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We consider Hermitian and symmetric random band matrices H m d ^ 1 dimensions. The matrix 

elements H^y, indexed by a;,y G A C Z'*, are independent and their variances satisfy a'^y := Ejii'ajyp = 

Xf^ W~ f{{x — y)/W) for some probability density /. We assume that the law of each matrix element H^y 

QQ is symmetric and exhibits subexponential decay. We prove that the time evolution of a quantum particle 

y—{ subject to the Hamiltonian H is diffusive on time scales t <^ W'''^. We also show that the localization 

• length of the eigenvectors of H is larger than a factor W''''^ times the band width W. All results 

^-j are uniform in the size |A| of the matrix. This extends our recent result [1] to general band matrices. 

>— >. As another consequence of our proof we show that, for a larger class of random matrices satisfying 

I ^^ a'^y = 1 for all y, the largest eigenvalue of H is bounded with high probability by 2 + M~^'^'^^ for 

• • any £ > 0, where M := l/(maxa;,j, a^y). 
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1. Introduction 

Wc proved recently [1] that the quantum time evolution e~'*'^/^ generated by a band matrix H with band 
width W is diffusive on time scales t ^ W"^^^, where d = 1, 2, 3, . . . is the number of spatial dimensions. 
As a consequence, we showed that typical eigenvectors are delocalized on a scale at least VK^+'^/s, i.e. the 
localization length is much larger than the band width. A key assumption in [1] was that the matrix entries 
Hxy satisfy 

\H,y\' = ^1(1^ \x-y\^W) , x,yeA, (1.1) 

where A is a large finite box in Z'' and M ~ W^ is a normalization to ensure that ^ \Hxy\'^ = 1. For the 
physical significance of this result in connection with the extended states conjecture for random Schrodinger 
operators, see the introduction of [1], where we also presented an overview of related results and references. 

The goal of this paper is to replace the rather restrictive deterministic condition (1.1) on the matrix 
elements with a natural general class of random variables. We consider symmetric or Hermitian random 
band matrices H = {Hxy) such that EHxy — and the variances a'^y :— EjiJj | are given by cr^ = 
W~'''f(^{x — y)/W), where / is a nonnegativc function satisfying J^jdxf{x) = 1. Thus, / describes the 
shape of a band of width W. The matrix entries are assumed to have an even law with subexponential 
decay. Under these assumptions we show that all results of [1] remain valid. 

The proof of quantum diffusion for general band matrices is considerably more involved than for matrices 
satisfying (1.1). Our proofs are based on an expansion in so-called nonbacktracking powers of H. As observed 
by Feldheim and Sodin [2,5], under the assumption (1.1) these powers satisfy a simple algebraic recursion 
relation which immediately implies that they are given by Chebyshev polynomials in H. In the language of 
perturbativc quantum field theory, the nonbacktracking powers correspond to a self-energy renormalization 
up to all orders. The underlying algebraic identity, however, heavily relies on the special form (1.1). If (1.1) 
does not hold, the renormalization is no longer algebraically exact and the recursion relation becomes much 
more complicated. There are two main reasons for this complication. The first is that the absolute value of 
each matrix element is genuinely random, and hence powers of matrix elements \Hxy\'' cannot be replaced 
by a constant. The second reason is that the variance (t^„ is no longer given by a step function in x — y. 
These two complications give rise to different types of error terms that substantially increase the complexity 
of the Fcynman graphs to be estimated. For instance if, instead of (1.1), we assumed 

aly = E\Hxy\' ^ j^l{l^\x^y\^W), x, y, G A, (1.2) 

i.e. if the band were given by a step function, then our proof would be simpler (in the language of the 
graphical representation of Section 6, we would not have any wiggly lines). 

We remark that some of the additional complications when considering ensembles more general than 
(1.1) have been tackled in [2] and [5]. In particular, Feldheim and Sodin, in Section III of [2], describe how 
to extend their result on the expectation value of traces of Chebyshev polynomials of Wigner matrices from 
(1.1) to more general distributions. In Section 9 of his paper on band matrices [5], Sodin states that the 
procedure of Section III of [2] can be extended to band matrices satisfying the restriction (1.2), but no details 
are given. It seems, however, that a^ being either a fixed constant or zero plays an important role. In this 
paper we consider more general band matrices (assuming less decay of the law of the matrix elements, and 
an arbitrary band shape), and we need to compute squares of matrix elements. Hence the structure of our 
expansion is more involved, and a novel approach is required to control it. 

As a simple consequence of our proof, wc also derive a bound on the largest eigenvalue Amax of a band 
matrix. This result holds in fact for a more general class of random matrices in which the spatial structure 



(and hence the dependence on the spatial dimension d) is absent. The relevant parameter for such matrices 

is 

M 



max^^j,E|iJ2^| ' 

characterizing, very roughly, the number of nontrivial entries in each row of H. It is easy to see that, in the 
special case of d-dimensional band matrices introduced in Section 2, we have M ~ CW^ where W is the 
band width; for a Wigner matrix we have M — N, where N denotes the size of the matrix. We show that 
Amax ^ 2 + M~2/'^+^ with high probability for any e > 0, provided that log A^ ^ Af^^; here c is a constant. 
For a smaller class of band matrices, Sodin [5] previously proved that Amax = 2 + o(l) in distribution, under 
the assumption logA^ <C M. (In fact, for M ^ TV^/^ he computes the asymptotic integrated density of 
states near the spectral edge, and for M ^ N^^^ he even identifies the limiting distribution of the largest 
eigenvalue as the Tracy- Widom distribution.) For other previous results on the largest eigenvalue of random 
band matrices see the references in [5]. In the special case (M — N) of Wigner matrices, similar estimates 
on the largest eigenvalue have been known for some time; we refer to the works of Soshnikov [6] and Vu [9], 
as well as references therein. 

The outline of this paper is as follows. In Section 2 we introduce the model and give the precise definition 
of the class of random band matrices we shall consider. Our main results are stated in Section 3. In Section 4 
we briefly summarize the Chebyshev expansion of the propagator from [1] . In Section 5 we perform a series 
of preliminary truncations using the subexponential decay of the matrix elements. The truncations are in the 
lattice size, the support of the matrix entries, and the tail of the Chebyshev expansion. Section 6 is devoted 
to a derivation of a path expansion for the propagator e^'*^", as well as a graphical scheme for the various 
terms appearing in the expansion. In this graphical representation, the propagator e^'*^" is expressed as a 
sum over graphs which consist of a distinguished path, called the stem, to which are attached trees, called 
boughs. The boughs carry the error terms arising from the non-exact renormalization. In Section 7 we 
take the expectation of our expansion, and describe the resulting lumpings corresponding to higher-order 
cumulants. Section 8 is devoted to the analysis of the bare stem, which yields the main contribution to our 
expansion. The arguments in this section are similar to those of [1], except that we also need to analyse 
higher-order cumulants. Finally, in the most involved part of the paper we show that the contribution of the 
boughs is subleading. For the convenience of the reader, we split the argument into two parts. In Section 9 
we present a simplified proof that is valid up to time scales t < W'^'^ with n < 1/5. Section 10 presents the 
additional arguments needed to reach larger times scales t < W'^'^ with k < 1/3. In the final Section 11 we 
derive a bound on the largest eigenvalue of H. 

We remark that the restriction n < 1/3 needs to be imposed for several different reasons; see the discussion 
in Section 10.1. This restriction is natural and can also be understood as follows. If (i) we do not resum 
terms associated with different n and n' (see (4.7) below), and (ii) we do not make systematic use of detailed 
heat kernel bounds^, then our method must fail for k > 1/3. For otherwise we could prove, as in Section 11, 
that the largest eigenvalue of an A^ x iV Wigner matrix is less than 2 + N^'^/^^^ with high probability; this 
is known to be false. 

Conventions. We use the letters C, c to denote arbitrary positive constants whose values are not important 
and may change from one equation to the next. They may depend on fixed parameters (such as d, /, a, and 
/3 defined below). We use C for large constants and c for small constants. For easy reference, we include a 
list of commonly used symbols and concepts in Appendix E. 



-"^As explained in Section 11 of [1], this involves a refined classification of all skeleton graphs in terms of how much they 
deviate from the 2/3 rule (Lemma 7.7 in [1]). 



Acknowledgements. We are grateful to a referee for suggesting improvements in the presentation as well as 
for pointing out some inaccuracies in a previous version of this manuscript. 



2. The setup 

Let the dimension d ^ 1 he fixed and consider the d-dimensional lattice Z'' equipped with the Euclidean 
norm |-|zci. Wc index points of Z'' with x,y, z, . . . . In order to avoid dealing with the infinite lattice directly, 
we restrict the problem to a finite periodic lattice Apf of linear size N. More precisely, for A^ e N we set 

An := {-[N/2],...,N-l-[N/2]y C Z"^ , 

a cube with side length N centred around the origin. Here [•] denotes integer part. Unless stated otherwise, 
all summations ^^ are understood to mean X^xeA • ^^ work on the Hilbert space £'^(An), and use 11-011 to 
denote the £^-norm oi tp £ i^{AN). We also use || A|| to denote the £^ operator norm of A : (."^{An) -^ i'^{Ais[)- 
For any x & 2/^ denote by [x]^ the unique point in A^r satisfying a; — [xjjv G Nl/ . Define the periodic 
distance on An through 

\x-y\ :== |[a;-y]Ar|^^. 

We consider Herniitian (or symmetric) random band matrices H^ = H whose entries H^y are indexed 
by X, y G Ajy. Here u; G fJ denotes the element of a probability space i7. The entries H^y are always taken 
to be independent random variables, with the obvious restriction that Hyx = H^y- 

Roughly speaking, we shall allow matrices H whose variances 

aly = E\H,y\' 

form a (doubly) stochastic matrix, such that the law of each matrix element H^y is symmetric. 

In order to define H precisely, we need the following definitions. Let A = A^ be a Hermitian matrix 
with independent entries that satisfy EjAj-j,!^ = 1. (Note that wc do not assume identical distribution of 
the entries.) We assume that the law of Axy is symmetric, i.e. that Axy and —Axy have the same law. In 
particular, A may be a real symmetric matrix with symmetric entries. Moreover, we assume that the entries 
Axy have uniformly subexponential decay: There exist a, /3 > 0, independent of x and y, such that 

n\A.y\>0 ^ /5e-«° (2.1) 

for all X, y and ^ ^ 0. In particular, we may consider Gaussian entries. 

In order to describe a band of general shape, we choose some nonnegative continuous^ function / : R"^ — > M 
satisfying J dx f{x) ~ 1 and J dx f{x) Xi = for alH = 1, . . . , rf. We define 

fix) := sup{/(2;) : |y - xK 1} 
and assume that there is a 77 > such that 

dx f{x) \xf+'^+" < 00. (2.2) 

''More generally, it suffices that / be continuous almost everywhere. In particular, / may be a step function. 



We also assume that the covariance matrix S = {^ij)i!^i,j!^d of /, defined by 

Y^ij := / da; f{x) XiXj , (2.3) 



is nonsingular. 

Let W, 1 ^ T4^ ^ A^, be the band width, and define the family of standard deviations (Txy ^ through 



2 _ 2 /w f\ ._ 1 rfi^^ylN 



ai^^ai^iWJ) := _/(^^-^j, (2.4) 

where 

M ^ MiW,NJ) := E/(llr)- (2-5) 

We then define the matrix H through 
We have the asymptotic identity 

^ - 1 (-) 

as VF — > oo, uniformly for all A'' ^ W . In the following we make use of (2.6) without further comment. For 
notational convenience, we use both W and M in tandem. The definition of H immediately implies that 

^E|ff,,|2 = ^< = 1 (2.7) 

y V 

for all X. Moreover, by symmetry of the law of A^y, we have 

Eif^^F™ = (2.8) 

whenever n + to is odd. Finally, we assume that 

N > WM^'^ . (2.9) 

We regard W as the free parameter. 

3. Results 

As in [1], our central quantity is 

Q{t,x) := E|(5,,e-'*^/25o)|', (3.1) 

where t G M and x E A^v- One readily sees that g{t, •) is a probability measure on Z'' for all t G M, i.e. 



J2eit,x) = 1. (3.2) 



The quantity g(t, x) has the interpretation of the probability of finding a quantum particle at the lattice site 
X at time t, provided it started from the origin at time 0. Here the time evolution of the quantum particle 
is governed by the Hamiltonian H. See [1] for more details. 



We consider time scales of order M'^ where k < 1/3. Thus, we set 

t = M^T , 
where T ^ is a quantity of order one. We consider diffusive length scales in x, i.e. distances 

where X is a quantity of order one. 

Our main result generalizes Theorem 3.1 of [1] to the class of band matrices with general distribution 
and covariance introduced in Section 2. 

Theorem 3.1. Let < k < 1/3 he fixed. Then for any Tq > Q and any continuous hounded function 
(p G Cb{'&'^) we have 



uniformly in N ^ W^+'^/^ andOi^T i^To. Here 

L{T,X) := / d\-—^=Gi\T,X) 

Jo TT VI - y 



(3.4) 



is a superposition of heat kernels 



G{T,X) := ' , ^-^x.^-^x 

(27rT)'i/2\/dctS 

where, we recall, S is the covariance matrix (2.3) of the prohahility density f . 

Remark 3.2. The numher A G [0, 1] in (3.4) represents the fraction of the macroscopic time T that the 
particle spends moving effectively; the remaining fraction 1 — A of T represents time the particle "wastes" 
in backtracking. The expression (3.4) gives us an explicit formula for the prohahility density - , ^ 1(0 ^ 
A ^ 1) of the particle moving a fraction A of the total macroscopic time T . See Section 3 of [1] for a more 
detailed discussion. 

Remark 3.3. As a corollary of Theorem 3.1, we get dclocalization of eigenvectors of H on scales W^'^'^'^''^ . 
Indeed, the methods of [1], Section 1 0, imply that the localization length of the eigenvectors of H is with high 
prohahility larger than the hand width times W'^'^''^ . See [1], Theorem 3.3 and Corollary 3.4, for a precise 
statement as well as a proof. 

Our methods also yield a new bound on the largest eigenvalue of a band matrix. This bound is in fact 
valid for a larger class of random matrices, for which the spatial structure and dimensionality arc irrelevant. 

Theorem 3.4. Let the N x N matrix A be as in Section 2, and take a family {o'xyJx y=i '^f variances that 
satisfy (2.7). Define 

M := ^—^ 

maxa;^j, a^y 

and set H^y '■— (JxyAxy ■ Then there is a constant c > such that for any e satisfying < £ < 2/3 we have 
where Amax denotes the largest eigenvalue of H and C^ is a constant depending on e. 



We stress here that the condition (2.9) applies to Theorem 3.1 only, and is not imposed in Theorem 3.4. 
The rest of this paper is devoted to the proof of Theorem 3.1, with the exception of Section 11 which 
contains the proof of Theorem 3.4. 



4. Summary of the Chebyshev expansion from [1] 

For the following, we fix T ^ 0; the claimed uniformity on compacts is a trivial consequence of our analysis 
and we shall not mention it any more. For notational convenience, we often abbreviate 

The starting point of our proof is the same as in [1] , i.e. the Chebyshev expansion of the propagator, 

oo 

e-»f = J2o^n{t)U^{0- (4.1) 

n=0 

Here t/„ denotes the n-th Chebyshev polynomial of the second kind, defined through 

TT I n\ sin(n + \)Q 

[/„ cos6i :== — ■ a ■ 4.2 

smy 

For our purposes it is more convenient to work with the rescaled polynomials Un{^) '■— Un{C/2). They 
satisfy the recursion relation 

UniO - ^U^-liO - Un-2iO (4.3) 

as well as 

The Chebyshev transform a„(i) of the propagator e~'*^ was computed in [1] (see [1], Lemma 5.1), 

n + 1 
an{t) = 2(-i)"^— J„+i(t), 

where J„(t) is the n-th Bessel function of the first kind. We shall need the following basic estimates on 
a„(i); see [1], Equations (5.4) and (7.14). We have the bound 

\anit)\ < ^, (4.4) 

as well as the identity 

n 

for all t G M. A trivial consequence of (4.5) that we shall sometimes need is 

|a„(i)| s^ 1, (4.6) 

for all n and t. 

Using the Chebyshev expansion (4.1) we may write 

g{t,x) ^ J2 a^{t)^^Ai)K[{Un{H))^^{Un'{H))^^]. (4.7) 

The expansion (4.7) is the starting point of our analysis. 



5. Truncations 

We begin the proof of Theorem 3.1 by introducing a series of truncations in the expansion (4.7). First, we 
truncate in the lattice size N by showing that the error we make by assuming A^ ^ W^' is neghgible (see 
(5.2)). Second, we use the subexponential decay of the matrix elements of A to cut off \Axy\ at scales M^ 
for an arbitrary i5 > 0. Third, we introduce a cutoff in the summation over n and n' in (4.7); this will 
prove necessary because the combinatorial estimates for the right-hand side of (4.7) that we shall derive in 
Sections 8-10 deteriorate for very large n and n' . 

5.1. Truncation in N. We replace the matrix H with a truncated matrix H, whereby we truncate in both 
the size of the lattice and the support of the distribution of the matrix entries. Both truncations are made 
possible by the following estimate on the speed of propagation of H. 

Proposition 5.1. Let N = N{W) ~ min(T4^^°''+^^, A^) and introduce the truncated Hamiltonian H defined 

Then there is a constant C > such that, for all t ^ M we have 

p(||e-"^/2<5o-e-'*^/25o||^^) ^ Ce""^", 

where a is the constant from (2.1). 

Proof. See Appendix A. D 

In a first step we truncate the lattice size TV. Defining 

g{t,x) := E\{S,,e-''"/Ho)\\ 
we therefore need to estimate 

for any ip e Cb{M.'^) and t — W'^'^T. Define the diagonal matrix E through 



Then the absolute value of (5.1) is equal to 

sc E||c-'*^/2^o - c-'*^/2^o|| (||Se-'*«/2^o|| + \\Ee-''"/Ho\ 
< CE||e-'*^/2^o-e-"^/2^o||, 

where we used that H and H are Hermitian, and ||_B|| ^ C. Using Proposition 5.1 we therefore conclude 
that (5.1) vanishes a.s W —> oo, uniformly for t ^ M. Note that the matrix a{W,N)H, where a{W,N) := 



— - — ' ~ J, , satisfies (2.7). Since Mmw^oo aiW, N) — 1, is is enough to prove Theorem 3.1 for the matrix 

a{W, N)H (it is straightforward to check that replacing T with a(W, N)T in our proof has no effect). 
We conclude that it is enough to prove Theorem 3.1 for 

N < v^iO'^+ie . (5.2) 

We shall always assume (5.2) from now on. 

5.2. Truncation in |^a;j,|. In a second step we truncate the support of the entries of A. Let S satisfy 

< 12(5 < 1/3 -K (5.3) 

and define the matrix A through 

A^y := A,yl{\A,y\^M^). (5.4) 

In following we adopt the convention that adding a hat (•) to a quantity (•) means that in the definition 
of (•) we replace A with A. In particular, we set 



Hxy --^ (JxyAxy and d{t,x) '■= E|(4,c '*-^/^(5o 
By the uniform subexponential decay of the entries (2.1), we have 

nHccy + H^y) ^ 2P(|A,y| > M^) < 2/3e-^°' . 
Therefore 



F{H^H) s^ J2^{Hxy^Hxy) s^ 2pN^^e-^"\ (5.5) 



x,y 
It is now easy to prove the main result of this subsection. 

Proposition 5.2. We have 



2.\d{tj^) ^ Qi^T^)] ^ Ce 



■M" 



Proof. Using the bound \Q{t,x)\ ^ 1, (5.5), and (5.2) we find 

X 

Note that, by the definition (5.4), the law oi A^y is symmetric. In particular, H satisfies (2.8). Moreover, 
we have the following bounds on the variance of H,j.y. 

Lemma 5.3. There is a constant C independent of x and y such that 



Proof. The upper bound is obvious from (5.4). In order to prove the lower bound, we write 

/•OO 

= aly / dsP(|A,^| > niax(ys,M^)) 
Jo 



which yields the claim. 

5.3. The tail of the expansion. Now wc control the tail of the expansion 

Q(t,x) - e[|(4,c-'*^/2(5o>|' 



^ a„(t)a„,(OE (C/„(i?))oJt/„'(i?))^„ 



a 



(5.6) 



n.n'^O 



As observed in [1], the coefficient an{t) is very small for n ^ t. Thus, we choose a cutoff exponent /j. 
satisfying 

K + iS < fi < 1/3-8(5. (5.7) 

The key ingredient for controlling the tail, i.e. the terms n + n' ^ M^ in (5.6), is the following a priori 
estimate on the norm of H . 

Proposition 5.4. There are constants C, e > 0, depending on S, such that 

f(\\H\\ > CM^^] sC M-^^ 

for M large enough. 

Proof. See Appendix B. n 

Defining 

we find, using Proposition 5.4, that 

^|^(i,x)-^b(i,a;)| ^ N''V(\\H\\;?CM^^) s^ N'^M'^^ ^ CM'" 



M 



Next, write 



9b[ 



it,x) ^ ^ a,,{t)a,At)E\{UniH))^^{Un'iH))^,l(\\H\\^CM'' 



n.n'^0 



(5.8) 



Split gbitjx) — gi,xi^T^)^^h,>{t,x) by splitting the summation over n,n' in (5.8) into the parts n + n' ^ M^ 
and n + n' > M^. 

We now estimate '^x\db.>{W'^'^T,x)\. To this end, we use the following rough estimate on Chebyshev 
polynomials. 



10 



Lemma 5.5. For any n G N and ^ G M we have 

ic/„(oi ^ c"(i + ieir. 

Proof. The recursion relation (4.3) combined with a simple induction argument shows that the coefficients 
of Un are bounded in absolute value by 2". This implies that 



\Unm ^ (n+i)2"(i + i^ir 



and the claim follows. 



n 



Using Lemma 5.5 we therefore get 



^\gb,>{t,x)\ s$ ^ ^ \an{t)an'{t)\ 
^ N^ Yl |a«(i)an'(t)| 



E 



E 



{Un{H))„JU^,{H))^^l(\\H\\^CM' 



r2S 



Ur 



iH)\\\\U^,iH)\\l(\\H\\^ 



CM 



2S 



Now from (4.4) we get 



Therefore 



n-\-n'>M^^ 






nln'l (n + n'y. 



fCW'^'^T 



dKrp\ n+n 



Ei,,>(^-T,.)i < N^ Y. (^; ^ ^— ^""" 



{CM^ 






-cM'' 



(5.9) 



Let us now consider the main term ft,.^(i,a;). In order to get a graph expansion scheme from (2.8), we 
need to get rid of the conditioning on the norm of i?, i.e. recover the expression 



Q^{t,x) := Y. an(i)««'(i)E[(C/„(i?))oJC/„K^)),o 



(5.10) 



n+n'^Af 



Therefore we need to estimate 

X 

< E E W{W'"^T)a^,{W''-T)\ E {Un{H)),SUrAH))M\\H\\>CM' 



X n+n'^Mi^ 



2S 



11 



The expectation is estimated, using Lemma 5.5, by 

Y,^[{UniH))^JU^.{H))^^l[\\H\\ > CM^' 



< C"+"E 



{i + \m 



)"+" l(\\H\\>CM^' 



s$ C 



n+n' ( ]\Td ]\/fS\n+n' 



(N^M'y 



\H\\ > CM 



2<5 



where in the last step we used the trivial bound 



Iff II ^ N'^M^. 



Thus, using (4.6), (5.2), and Proposition 5.4, we find 

'^\gb.,s:{W'^^T,x)-g^{W'^^T,x)\ < Af^^^pf ||i?|| > CM^''') s^ M' 



CM'- 



M 



-eM 



^ M 



-M" 



(5.11) 



as W -^ oo. 

The following proposition summarizes our results from this section. It shows that on time scales t < W'^'^, 
instead of the original density g{x,t) defined in (3.1) it will be sufficient to deal with the density Q^{x,t) of 
the truncated dynamics defined in (5.10). In the rest of the paper wc shall work with ^^(x, t). 

Proposition 5.6. We have 



J2\eiW^''T,x)-g^{W'^''T,x)\ s^ M- 



M" 



for some c > 0, where 'g^ is defined in (5.10). 

Proof. Proposition 5.6 is an immediate consequence of Proposition 5.2 and the equations (5.3), (5.9), and 

(5.11). n 

Note moreover that in the definition (5.10) the sum ranges only over indices n and n' such that n + n' 
is even. This follows from the fact that Un is odd (even) for odd (even) n, and that H satisfies the moment 
condition (2.8). 



6. The path expansion 

In this section we develop a graphical expansion to compute the matrix elements of Un{H) needed to evaluate 
^^(a;,t); see (5.10). The result of this expansion is summarized in Proposition 6.7, which expresses Un{H) as 
a sum over graphs. The main idea is that, thanks to the special properties of the Chebyshev polynomials, we 
can express Un{H) in terms of nonbacktracking powers of H , up to some error terms. The nonbacktracking 
powers make it easier to identify the main terms and the error terms in the computation of the expectation 
in (5.10). The expectation will be computed in Section 7 by introducing an additional structure, the lumping 
of edges, to the graphical representation. Eventually, the main terms will correspond to certain very simple 
graphs with a trivial lumping (ladders) and their contribution yields the final limiting equation (Section 8). 
The contribution of all other nontrivial graphs or nontrivial lumpings will be negligible in the VF — >■ cx) limit; 
the estimate of these error terms constitutes the rest of the paper. 



12 



6.1. Derivation of the expansion. For n G N abbreviate 



Un := Un(H). 

(Note that in (4.1) t/„ = Un{^) denoted the standard Chcbyshev polynomials, but for the rest of the paper 
we shall use f7„ to denote the matrix Un{H).) Thus we have 



as well as 



Uo = 1, Ui ^ H , U2 = H^ -1 



Un = HUn-l - Un-2 (n ^ 2) 



(6.1a) 



(6.1b) 



Next, for n ^ 2 we define Vn as the n-th nonhacktracking power of iJ , i.e. 



[''nJxoXn •— 7 , 



Xi ....,Xn-i 



Y\^ i{xi ^ xi+2) 



4 = 



^Xi)X\^X-iX2 ' ' ' ^Xn-lXn 



We also define Vq '■— 1, Vi '■— H, and t^„ := for n < 0. In order to derive a recursion relation for Vn, we 
define the matrices $2 and $3 through 



^ z ^ z ^ ' 

{^3)xy '■= —\Hxy\ Hxy 1 

where in (6.2a) we used (2.7). Moreover, we introduce the shorthand ^sKi, defined by 



(6.2a) 
(6.2b) 



(M»)-ox„+i :- Y. 



^ l{Xi ^ Xi+2) 



4 = 



\^Z)xqX-i^X\X2 ' ' ' ^X^X^^i J 



we use the convention that ^^Vq = $3. 



Lemma 6.1. We have that 



Fo = 1 , Vi ^ H, V2 = H^ -1-^2 



as well as 



Vn = HVn-l - K-2 - $2^-2 - $3^-3 (n > 2) . 



(6.3) 



Proof. The expressions for Vqj ^1j ^2 are easy to derive from the definition of Vn- Moreover, for n ^ 3 we 
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find 



{HVn-l)xQX„ 


- y. 


"n-2 


f2) 


Hxo 


Ki 


"2:12:2 ' 


••^.„-l 


Xn 






Xi,...,Xn~l 


"n-2 






= >; 


j|l(,X, ^X,+2) 


-'^a;oa;i-'^a;ia;2 ' ' ' ^^n-l^n 




Xi,...,Xn-l 


.i=0 








"n-2 






+ y ] l{xn = X2) 


_[_[ l(xi 7^Xi+2) 


J^X^)X-iJ^XiX2 ' ' ' ^Xn-iXn 




a;i,...,x„_i 


.1=1 








"n-2 






= {V„)xox„ + ^ l(a;o = X2) 


[]l(.T, ^a;,+2) 


-^2:02:1 -Wa;ia;2 ' ' ' ^^n-i^n 




a;i,...,a;„_i 


.1=2 








"n-2 






- y ] l{xn = .X2)l(.Ti = .X3) 


11 l{Xi ^ Xi+2) 


J^X^)X-i-tixiX2 ' ' ' ^Xn-iX 




a;i,...,x„_i 


.4=2 






= (K)xox„ 


^>;i^x„xj' 


Vn- 


-2):eo 


2^71 


+ (*3K-3)xox, 


1 7 





by (6.3). This yields 



HVn-l = y„ + V;_2 + $2^-2 + $3^-3 : 



and the claim follows. 

We may now derive the path expansion of C/„. To streamline notation, it is convenient to define ^2Vn 



n 



Proposition 6.2. We have 



fe>0ae{2,3}'= £o-\ \-ik-n.-\a\ 



(6.4) 



where the sum ranges over ii ^ for i = Q, . . . ,k. Here we use the abbreviation a ~ (ai, . . . , aj.) as well as 
Proof. Define the matrix _D„ through 

It is easy to sec from (6.1) and Lemma 6.1 that 

Do ^ 0, -Di = 0, ^2 = $2, 

as well as 



We prove 



£)„ = HDn-l - D„-2 + $2^-2 + ^3^-3 ■ 
fe^lae{2,3}*= £o + --+ek=n-\a\ 



(6.5) 
(6.6) 



14 



using a simple induction argument. The cases n — 0,1,2 are trivial. Assuming the claim holds up to n — 1, 
we get from (6.5) 

fe>lae{2,3}'» £o + --+ek=n-\a\-l 

-E E E V,,^a^^---^aJ^ 

*:>lae{2,3}'= io + ---+f.k=n-\a\-2 

+ ^2K-2 + ^3^-3 
fc>lae{2,3}'' ^o + --+ffe=n-|a|-2 

-E E E v^^o^oi^^-'-^o^ 

fe>lae{2,3}'= £oH h^fc=n-|a|-2 

+ 52K^+*3K-3+E E E Vl ^a^Vi^ --- K,Ve, 



fc>lae{2,3}'= ^i + ---+4="-|a|-l 

= E E E Vlo+2 ^a,Vi, --- ^a,Vi, 

fc>lae{2,3}'» <?o + --+<?fc=n-|a|-2 

+ E E E {^2Vio+ ^3Vi,-l ) -^a,Vi, --- -^a,Vi, 

fe>lae{2,3}'= £oH h^fc=n-|a|-2 

+ 5214-2+53K-3+E E E ^l$ail^---5aj^, 

fc>lae{2,3}'= ^i + ---+4=«-|a|-l 

where in the last step we used Lemma 6.1. Thus (6.6) is proved. 
Finally, (6.4) is an immediate consequence of (6.6). 



n 



6.2. Graphical representation. The path expansion (6.4) is the key algebraic identity of our proof. Wc now 
introduce a graphical representation of (6.4) by associating a rooted tree graph G with each summand in 
(6.4). 



H^ 



iH%,, 



(Ve). 








— o 
1 






(*3)xoXi 


1 




o 




— o — 

1 


— o — 
2 


— o — 


— o o — 


— o 
6 




o 

(*2)xoxo ■■ O 







1 


2 






6 



(Mi)xo. 



1 



-o 
2 



( ^2Vl )xoxi : O O 

1 



Figure 6.1: The basic graphical units. 

Before giving a precise definition of our graphs, we outline how they arise from (6.4). A matrix clement 
HxQXi is represented by two vertices, and 1. To each vertex v we assign a label Xy G Ajy. Matrix 
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multiplication is represented by concatenating such edges. Thus, H^gxi • • • -ffa;„„ia;„ is represented as a 
sequence of vertices 0, . . . , n joined by n edges. The root is always the leftmost vertex, and the edges are 
directed away from the root. If two neighbouring vertices u,w of a vertex v are constrained to have different 
labels (the nonbacktracking condition), we draw v using a black dot; otherwise, we draw v using a white 
dot. A factor $2 gives rise to a directed edge, represented by a slashed double line, whose final vertex is 
"dangling" in the sense that it has degree one. A factor $3 is represented by a wiggly edge. See Figure 6.1 
for an illustration of these rules. 

Using these graphical building blocks we may conveniently represent any summand of (6.4). See Figure 
6.2 for an example. 




Figure 6.2: Graphical representation of the term V2 $2 ^2 'J&3V2 '^3 ^i <&2Vi $2V2 $3^3 . 

6.3. Definition of graphs. We now give a precise definition of a set of graphs that is sufficiently general for 
our purposes. Let G be a finite, oriented, unlabelled, rooted tree. We denote by V(G) the set of vertices 
of G, by £(G) the set of edges of G, and by a{G) € V(G) the root of G. That G is oriented means that G 
is drawn in the plane, and the edges incident to any vertex are ordered. (Thus, each edge e adjacent to a 
vertex v has a successor, defined as the next edge adjacent to v counting anticlockwise from e.) In particular, 
two graphs are considered different even if they are isomorphic in the usual graph-theoretical sense but the 
ordering of the edges at some vertex differs. This notion of orientation can be formalized using Dick paths 
(see e.g. [3], Chapter 1). Such a formal definition is not necessary for our purposes however. 

The choice of a root a{G) implies that we may view G as a directed graph, whereby edges are directed 
away from the root. Thus we shall always regard an edge e — (v, w) as an ordered pair of vertices. Given an 
edge e = (y, w) G £{G), we denote by a(e) = v the initial vertex of e and by &(e) = w the final vertex of e. 

There is a natural notion of distance between vertices: For w, w e V(G) we set d{v, w) to be equal to the 
number of edges in the shortest path from v to w. Each vertex v 7^ a(G) has a parent w, defined as the 
unique vertex adjacent to v and satisfying d{a{G),w) — d{a{G),v) — 1. If w is the parent of v we also say 
that i; is a child of w. Similarly, if an edge e is not incident to a{G), we call the (unique) edge e' satisfying 
a(e) = fo(e') the parent of e; in this case we also call e a child of e'. 

We require that G have an additional distinguished vertex b{G) G V(G), which need not be different 
from a{G). The path connecting a{G) to b{G) is called the stem of G, and denoted by S(G). When drawing 
G in the plane, we draw the stem as a horizontal path from a{G) at its left edge to b{G) at its right edge. 
We require that all edges not belonging to the stem lie above it (see Figure 6.3). Ultimately, the vertices 
a{G) and 5(G) will receive the fixed labels Xa(G) = ^ ^md xi,(^g) — J/ iii ^^^ graphical expansion of the matrix 
element {Un)xy 

We denote the set of such graphs by 211. We call an edge e G £{G) a stem edge if it belongs to £{S{G)), 
and a bough edge otherwise. If G has no bough edges, we call it a bare stem. A bare stem is uniquely 
determined by its number of edges. 

Thus, a graph G G SU consists of a stem and a collection of rooted trees, called boughs. Each bough is 
directed away from its root vertex, which belongs to the stem S{G). We abbreviate with B{G) the subgraph 
of G consisting of all bough edges. We call a bough edge e G £{B{G)) — £{G) \ £{S{G)) a leaf if 6(e) has 
degree one. See Figure 6.3 for an example of a graph in 211. 
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o — « — o- 








a{G) 



-o — > — 6 

KG) 



Figure 6.3: A graph in 21J. 



Next, we decorate graphs G G SU as follows. First, we tag the edges, i.e. we choose a map tg on £{G), 
called a tagging, with values in the set of tags 



{(s,0),(s,l),(6,0),(6,l),(5,2),(6,3),(6,4)} 



(6.7) 



Here s stands for "stem" and b for "bough" . We require that the tag Taie) be of the form (s, z) if e G £{S{G)) 
and of the form {b, i) otherwise. The index i (taking values in {0, 1} for stem edges and {0, . . . , 4} for bough 
edges) is used to tag different types of edges. Edges whose tag is (s, 0) or (6, 0) are called large; other edges are 
called small. The reason for this nomenclature lies in the magnitude of their contribution to the value of the 
graph after taking the expectation; see Section 9. Second, we choose a symmetric map Iq '■ V(G)^ -^ {0, 1} 
which will be used to encode all nonbacktracking conditions on G. The idea is that Ig{v,w) — 1 induces 
a constraint Xy ^ x^ on the labels. We require that l{v,w) — unless d{v,w) — 2. We call the triple 
(G, tq^Iq) a decorated graph, and denote the set of decorated graphs by ©. 

Next, we associate a value ^xy{Q) with each decorated graph ^ € ©. The value '^xy{G) is a random 
variable that depends on two labels x,y G A^r. For the following we fix ^ = {G,tc,Ig)- We shall assign 
a label Xy G Ajv to each vertex v G V(G) in such a way that x ~ Xa(G) ^-nd y = Xftf^)- To define ^xyiG) 
we first assign a polynomial in the matrix entries to each edge. Let e G £{G) and abbreviate xq = Xa(e) 

and xi = xi,(e)- We associate a polynomial PT^(e){Hx„xi, Hx-^xa), and a degree deg^^(e) = deg(e), with e 
according to the following table. 



rcie) 


-< TG(e)(-"2:o2;i , J^xixo) 


deg(e) 


(,s,0) 


HxQXi 


1 


(s,l) 


^\-tixaXi\ J^xoxi 


3 


(5,0) 


\-n-xoxi \ 


2 


(M) 


l-Wxoxi 1 — '^xaxx 


2 


(6,2), (6,3) 


\-n-xaxi \ 


4 


(^4) 


\-n-xoxi \ 


6 



Note that deg(e) is nothing but the degree of the polynomial P^^(^^y The degree of Q is 

deg(e^) := V deg(e). 



E 

ee£{G) 



(6.8) 



17 



In order to define ^xy{G) it is convenient to abbreviate the family of labels by x = [Xy 
Then we set 



"OxyiG) 



E^-(0.".^MO 



"l/aji, 



Yi (1-'g(w,w)4„x, 



.t),tueV(G) 



n 

.ee£(G) 



P- 



rcie) 



\-"-Xa{e)Xb 



,H 



Xb(e)3:alc 



GV(G)). 



(6.9) 



The summation over x means unrestricted summation for all x^ G Ajy, v G V(G). 

We call a stem vertex v G V{S{G)) \ {a{G),b{G)} nonbacktracking if the two stem edges adjacent 
to V, {u,v) and (v,w), satisfy Zg(u,w) — 1; according to (6.9), this means that we have the constraint 
2^11 7^ x-uj- Otherwise we call v backtracking. We call the stem S{G) completely nonbacktracing if all vertices 
in V{S{G))\{a{G),b{G)} are nonbacktracking. Decorated graphs (G, Tc^ la) G are represented graphically 
as follows. Each edge of G is drawn using a decoration that identifies its tag TG(e); see Figure 6.4. (Note 
that, although Figure 6.4 suggests that decorated bough edges are double, they are in fact single. This 
graphical representation using double lines is chosen in the light of the graph operations J>j, Tc, and TZ 
defined below.) Non-backtracking stem vertices are drawn with a black dot; other vertices arc drawn with 
a white dot. Note that using black and white dots to draw the vertices displays only partial information 
about Iq'- Only nonbacktracking restrictions pertaining to pairs of vertices both in the stem arc indicated in 
our graphical representation. See Figure 6.5 for an example of a decorated graph. 



(.,0) 



(5,1) 



O 

(fe,0) 



o 

(6,1) 



(6,2) 



(6,3) 



(6,4) 



Figure 6.4: The edge decorations along with their associated tags. 




Figure 6.5: A decorated graph in 85. 



6.4. Operations on graphs. As it turns out, in order to control the graph expansion we shall have to make 
all stem vertices apart from a{G) and 6(G) nonbacktracking. To this end, we introduce two operations, 
Fn and J-"c, on the set of decorated graphs ©. We shall prove that after a finite number of successive 
applications of either Fn or Fc to an arbitrary decorated graph, we always get a graph with a completely 
nonbacktracking stem. The index n stands for "nonbacktracking" and c for "collapsing" . The idea behind 
the definition of J-"„ and J^c is to choose the first (in the natural order of S{G)) backtracking stem vertex 
vi G V{S{G)) \ {a(G),6(G)} and introduce a splitting in the definition (6.9) using 



1 = 1(2^1,0 ^2:^2) + 1(^1 
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where the vertices vq, V2 G V{S{G)) are the neighbours of vi in the stem, i.e. they satisfy {vq, wi), {vi, V2) G 
S{SiG)). 

We now define Tn and J^c more precisely. If S{G) has no backtracking vertex, set Tn{G) '■= Q and 
Fc{Q) '■= 0, where is the empty graph satisfying '^xy{^) '■= 0. 

Otherwise, let t^i be the first backtracking vertex in V(5(G))\{a(G'), 6(G)} and define vq and V2 as above. 
Then we set Tn{Q) ■— (G, tq, Iq), where 

lGiv,w) := Ig{v,w) + Sy^gSyjy^ + SiovoSvv2 ■ 

Thus, the operation J>i simply makes the vertex vi a nonbacktracking vertex of 5(G) without changing G 
or tg, i.e. it sets ?g(wo, 1^2) — ^g(^2, ^o) = 1 and leaves Ig unchanged for any other pair of vertices. 

Next, we define J\,. Let Wg, f 1, i'2 be as above. The operation Tc collapses the two nearest stem neighbours, 
Vq and V2, of vi into one vertex and fuses the two edges {vq,vi) and (wi,'y2) into one edge (see Figure 6.6). 
This definition is very natural in the light of Figure 6.6 and our choice of conventions for drawing bough 
edges as double lines. Thus, a reader who believes his eyes when gazing at pictures like Figure 6.6 may safely 
skip the following two paragraphs. 

To define the operation J-"c precisely, we identify vq with V2, i.e. introduce the equivalence classes 

\v] - l^^^ ^^^ ^ {^'0,^'2} 

Define the graph G through its vertex set V(G) = {[v] : v € V(G)}, and its edge set, which is obtained as 
follows. Each edge {v,w) € S{G) \{{vi,V2)} gives rise to the edge {[v], [w]) € £{G)- Thus, the edges (wqj^'i) 
and {vx,V2) are fused into a single edge {[vq], [vi]). The tag Tg(([w], [w])) is by definition equal to the tag 
tg((i;,w)) if (w, w) 7^ {vq^vi)] the tag of the edge ([wo], [fi]) is defined by the following table. 



tg{{vo,vi)) 


TG 


((^1,^2)) 


Tg((M,M)) 


{s,0) 




(5,0) 


(6,0) 


(s,0) 




(5, 1) 


(^2) 


(5,1) 




(,S,0) 


(5,3) 


(5,1) 




(5, 1) 


(^4) 



The initial and final vertices of G are given by a{G) := [a(G)] and 5(G) := [b{G)]. The edges of G are 
oriented in the natural way when drawing G and G in the plane; instead of giving a formal definition of the 
orientation, we refer to Figure 6.6. 

Finally, we define the map /g, which encodes the nonbacktracking information of G, through 

, ,_ _, I 1 IgIv, w) — 1 for some pair of representatives v £v,w £ w 

l^{v,w) := i . 

I otherwise . 

Thus, in the graphical representation of J^c{G) '■— {G,t^,1^), the vertex [vq] = [V2] is always white (i.e. 
backtracking). Note that if vq or V2 was nonbacktracking, this restriction remains encoded in the map Iq, 
but is no longer visible in the colouring of the vertices. 

We summarize the key properties of J>i and J^^, which follow immediately from their construction. 
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V2 
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Vq V2 




Figure 6.6: Graphical representation of the operations J>i and J^c- 

Lemma 6.3. Let gee. Then Tn{g),Tc{g) G 0. Moreover, 

"SxyiG) - 23,j,(J-„(g))+23,^(J-c(g)), 

and 

deg{T„{g)) = deg{T,{g)) = dcg(g) . 

6.5. Graphs with completely nonbacktracking stem. Next, we introduce two special subsets of decorated 
graphs. We define ©' C © to be the set of decorated graphs corresponding to terms in (6.4). See Figure 6.2 
for an example. More precisely: 

Definition 6.4. The set ©' is the subset of {G,tg,Ig) G ® satisfying 

(i) All houghs of G contain only one edge, whose tag is (6, 1); 

(ii) Ig{u, w) ~ \ if and only if there is a vertex v that is not the root of a bough, such that {u, v), {v, w) G 
£{S{G)) withTG{{v,w)) = (s,0). 

Property (ii) says that all bough vertices (including the bough roots) are white, and that the left vertex 
of a wiggly edge is white. The remaining vertices (apart from a{G) and b{G)) are black. It is easy to see 
that the graphs associated with terms on the right-hand side of (6.4) belong to C5' 

Note that, unlike in the case of a general graph Q E &, the nonbacktracking information of a graph Q E &' 
is fully encoded in the colouring of its vertices. Indeed, Ig{v, w) can only he 1 ii v,w € V{S{G)). Moreover, 
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from (ii) we see that Ig is uniquely determined by G and tq- Thus, a decorated graph Q = {G,tq,Ig) G 25' 
is uniquely determined by its graph and tagging, i.e. the pair (G,tg)- 

The second important subset of decorated graphs is generated from ©' by applying the operations J>i, J^c 
to decorated graphs in 25' until the stem is completely nonbacktracking, i.e. all stem vertices (apart from 
a{G) and h{G)) are black. 

Definition 6.5. For Q Cz & we define .^g as the set of decorated graphs Q Cz & whose stem is completely 
nonbacktracking and that are obtained from Q by a finite number of operations J-n and Tc- Furthermore we 
set 

©J := U ^6- 

The set 25(j is the set of "good" graphs that we shall work with in later sections. Thus, given a graph 
Q ^ & corresponding to a summand of (6.4), we first transform it into the family SSg of graphs in ©j. The 
contribution of Q to the expansion (6.4) is given by the sum of the contributions of all graphs in S§g (see 
(6.10) below). We then exploit the fact that we have good estimates on the contributions of graphs with 
completely nonbacktracking stems. 

Next, we state and prove the key properties of the set ©j ^'^'^ the operations Tn and Tc- 

Proposition 6.6. (i) If Q = {G,tq,Iq) G ©j then Iq is uniquely determined by the pair {G,tg) alone. 
In other words, there is a function £ such that Iq — i{G, tq) for all (G, tq, Iq) G ©jj. 

(ii) If {G,tq,Iq) G ©j then all leaves of G are small (in Tq). 

(Hi) If (G, Tq, Iq) G ©tt and e G S{G) has tag TQ{e) — {b, 1), then e is a leaf of G. 

(iv) ifg^g'e& then .%g n ^g^ 



(v) For any tj G ©' and Q G ^g we have dcg{Q) ~ dcg(Q). 



(vi) For each Q £ 0' we have 

2Ix,(a) - J2 "^-vi^)- (6-10) 

Gesgg 

Proof. The key ingredient of the proof is the following ripping operation, denoted by TZ. It provides a link 
between the sets ©jj and ©', and is essentially the converse of multiple applications of Fn and Fc- The idea 
is to take hold of the vertices a{G) and 6(G) of a given tagged graph {G,tq) and "pull them apart", thus 
"ripping open" all bough edges of Q except those of type (b, 1). When interpreted graphically, the character 
of each edge (straight or wiggly) is kept unchanged, whereby the double edge of a bough edge is split into 
two single edges. 

When defining TZ it is convenient, in a first step, to "rip open" all bough edges (including those of type 
(6, 1)) of {G,tq); we shall call the resulting tagged graph V{G,tg). In a second step, we undo the ripping 
of all bough edges of type {b, 1), which results in the tagged graph TZ{G, tq). 

In order to define V, we need one additional tag (s, 2) for stem edges, which we draw with a single solid 
line that is slashed. Stem edges of type (s,2) result from the ripping open of a bough edge of type (6, 1). 
By walking around G, we associate with the tagged graph {G,tq) a tagged bare stem {G,tq) =: 'P{G,tq). 
More precisely, we draw G in the plane, and start at the vertex a{G). At each step, we move along one edge 
of G in such a way that we always remain to the left of G; see Figure 6.7. Every stem edge is travelled once, 
and every bough edge twice. Each time we move along an edge e G £{G), we add an edge e to the stem G. 
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Depending on whether we moved along e in the direction of e (denoted by +) or against the direction of e 
(denoted by — ), we associate a tag Tg(e) with e according to the foUowing table. 

TG'(e) direction '''^^(e) 



(s,0) 

(6,0) 

(&,2) 
(6,2) 
(6,3) 
(6,3) 



± 
± 



± 



(s,0) 

(s,l) 
(s,0) 

(s,0) 
(s,0) 



Graphical representation of (s, 2): o — i — O 




Q) ;.o) 



o a b' a 




6(G) 



Figure 6.7: The walk around G. 

These rules are made obvious by a glance at Figure 6.4; indeed, a tagged bough edge is represented with a 
double line which corresponds exactly to the two single lines resulting from ripping the bough edge open. 
Figure 6.9 provides an example of the operation {G,tg) i-> V{G,tg). The map V can also be interpreted 
as first doubling all bough edges according to their tags, and ripping them open successively by pulling the 
edges a{G) and 6(G) apart; see Figure 6.8. 

We now define TZ{G,tq) — Q' to be the unique decorated graph Q' = {G' ,tq/,Iq/) £ &' that satisfies 
V{G,Ta) — P{G' ,tg'); see Figure 6.9. That there is exactly one such Q' E &' follows immediately from the 
definitions of V and ©', as well as the fact that Iq' is uniquely determined by the pair {G',tg') through 
Definition 6.4 (ii). (Thus, the operation V plays only an auxiliary role, its sole purpose being to clarify the 
definition of TZ.) 

Having defined the ripping operation TZ, we are now ready to prove Claim (i) of the Proposition. Before 
giving the full proof we outline the strategy. First, for any Q ^ (G, tq, Ig) G ®tt ^'^ construct the ripped graph 
TZ{G,tg) € ©', which does not depend on Ig- Second, by definition of ©', the ripped graph Q' = TZ{G,tg) 
bears a unique nonbacktracing map Ig' ■ Third, by definition of ©j, there is a sequence ii,. . . ,ik € {n,c} 
such that Q = (G, rg, Ig) = (J^ik ° ■ ■ ■ ° J^i-^){'JZ{G, tg)). Fourth, we prove that this representation is unique. 
Thus we have expressed Ig as a function of {G,tg)- 
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-(y\y\j^\^-\-0 




■0-^./^./'^./^0 



H^ o O'\^^''^^^o qn^^-'v/^o 0'\^^-~'^o I — > o cr^^^-'^^^o- 




-Qn/n^'^/vO 



-O-v/v/'n^nO o-N^^/x/xO 1 O 1 O 0'x^^--\^MD 



Figure 6.8: The dynamical process of successively ripping open doubled bough edges. Note that the stem edges with 
tag (s, 2) (drawn with a slashed single line) always occur in consecutive pairs. 
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V{G,tg) 



-t O 1 O O 1 O 1 O'xyx./x.^MD O 







0-X^N>'N/V.« O 



niCra) 



Figure 6.9: The definitions of V and 7?.. 

Now to the proof of (i). Let G = {G,tg,Ig) G 0j. By definition of 0[t, there is a decorated graph 
Q' = (G", tg',Ig') G 25' and a finite sequence ii, . . . ,ik G {n, c} such that 

g = (J-.,o...oj-,j(g'). (6.11) 

We now claim that both Q' and the sequence zi, . . . , i^, are uniquely determined by (G, tq) (under the obvious 
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constraint that no Ti is allowed to act on a decorated graph whose stem is completely nonbacktracking) . 
Indeed, we must have that Q' = TZ{G,tg). (This follows immediately from the fact that TZ is left invariant 
under the action of J"i, i e {n,c}; i.e. 7^(G2,tgJ = 7?.(Gi,tgJ for any (G2,tg2, ^Ca) = -^i(Gi, tgi, /gJ 
where (Gi,tgi,;gi) e 0.) 

That different (under the above constraint) sequences applied to TZ{G,tg) yield a different tagged graph 
is an immediate consequence of the following general claim. In order to state it, we introduce the set !^^ 

as the set of decorated graphs obtained from ^ € (5 by a arbitrary applications of the operations J"„, J^c- 
(The set !^q will be used in the statement and the proof of the following Claim (*). We remark that the 
previously defined set ^g is a subset of !^g with the additional requirement that the stem is black.) 

(*) Let G — (G, Tq, /g) G © be an arbitrary decorated graph whose stem is not completely nonbacktracking. 
Then for any Qi = (Gi,rGi,/Gi) e ^^^(g) and 02 = (G2,tg2,/g2) ^ ^^^(g) we have Gi 7^ G2. 

Claim (*) will be used in the following situation. We shall apply sequences of operations J>i and J^, to a 
decorated graph Q' € ©'. If two sequences of such operations differ from each other in at least one step, then 
the resulting two graphs will be different. In other words, if a decorated graph Q can be written in the form 
(6.11) and G' is known, then the sequence zi, ^2, . . . , ifc is uniquely determined. Together with the uniqueness 
of Q' = TZ{G,tg) established earlier, this proves the uniqueness of the representation (6.11). Thus we can 
define the map i : {G,tg) — >■ Iq through 

g = (g^tgJg) = (J-,, o...oJ■,J(7^(G,TG)), 

and hence Claim (i) follows. 

We now prove Claim (*). For any graph G G 2U and integer g e Qq := {0, 1, . . . , |£:(5(G))| +2\£{B{G))\}, 
we define the vertex ^^(f?) € V(G) as the vertex reached after q steps of the walk around G (see Figure 6.7). 
For q G Qq we define the "time of next return" rr,{q) as the smallest integer q' > q such that vpi{q') ~ v^(q); 
if there is no such g', we set q' := 00. 

Next, let vi e V(G) \ {a{G),b{G)} be the first backtracking stem vertex of G, and denote by vq its 
parent vertex (for an example see Figure 6.6). Define go as the "last time we walk across wq", i.e. as 
the largest integer in Q^ satisfying wg((7o) = "^o- By definition of qo, we have r^{q{)) = 00. Now define 
Qc = {Gc,tg^,Igc) •= J^ciG)- Clearly, we have that rG^{qo) < 00. Moreover, one readily sees that 

rG2{qQ) = rG^qo) < rGiiQo), (6.12) 

for aU Qi = (Gi,rGi,/Gi) <= ^jr rg) and Q2 ~ (G2,tg2,^G2) ^ ^jr (g)- The equality expresses the fact that 
Vq and V2 have already been collapsed into one vertex in all Q2 G !^jr (p)- The inequality expresses the fact 
that, while vq may be collapsed with a stem vertex Vj at some point when constructing Gi G ^jr /cv ^^^ 
walk from vq to Vj is strictly longer than from vq to V2. Claim (*) follows immediately from (6.12). 

Next, we prove Claim (ii). li Q G &' then by definition all leaf edges have tag (6, 1), i.e. are small. This 
also holds for Tn{G) (trivially), as well as for J^dG)- In order to see this, define the property (Pg) as follows. 

(Pg) If a vertex v G V(G) that is not the root of a bough satisfies {u,v), {v,w) G £{S{G)) for some vertices 
v,w and if the tags of {u,v) and {v,w) are both (s,0), then the vertex w is a nonbacktracking stem 
vertex. 

Property (Pg ) for a decorated graph Q means that a vertex between two straight stem edges is black unless 
it is the root of a bough. It is easy to see that the property (Pg) satisfied for all Q E &' (see Definition 6.4 
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(ii)). Moreover, (Pg) is invariant under Tc and Tn- Recalling the definition of 0j, we see that Claim (ii) 
follows by induction. 

Next, Claim (iii) clearly holds xiQ = {G,tq,Iq) € ©'. Moreover, by definition of J>i and 7^, Claim (iii) 
holds for Fn{G) and J^c{G) if it holds for Q. Hence Claim (iii) follows from the definition of ©j. 

Claim (iv) is an immediate consequence of the fact that if tj g ,^g then Q = Tl(G)- 

Claim (v) is an immediate consequence of the fact that, by definition of J>i and Tc, we have deg(Q) = 
deg(J-„(^)) = deg{T,{g)). 

Finally, we prove Claim (vi). Let Q E &' . Using Lemma 6.3 repeatedly, we get 

9J.y(a) - ^.y{Mg))+^.y{T,{g)) 

= "XJ^yiTniTniG))) +'i}.y{M^n{g))) +'S.v{Tn{M5))) +'S.v{MM5))) 

= ... = ^2J,,(g,), 
iei 

where {Qi)i<:i is a finite family of decorated graphs whose stems are completely nonbacktracking. By defin- 
ition of ^g, we have -SSg = {Qi : i € I}. What remains is to show that each Q e ^g appears only 
once in {Gi)i^i. But this is an immediate consequence of the uniqueness of the sequence ii,. . . ,ik in the 
representation Q = {Ti^ o • • • o Fi^^{Q)] sec the proof of Claim (i) above. D 

In view of Proposition 6.6 (i), we may regard the set ©j as a set of tagged graphs [G^tq)- Wc shall 
consistently adopt this point of view from now on. 



Proposition 6.7. We h 



ave 



{Un).y - Y. ®-S'(^) = E ^^-viS), (6.13) 

gee' :dcg{g)=n ee«„ 

where ^xy{G) is defined in (6.9), and we defined the subset of graphs 

&n ■■= {Ge&i : dcg{g) = n} . 

Proof. The first equality of (6.13) follows from (6.4) and the definition of ©' (see Definition 6.4); the second 
from Proposition 6.6 (iv), (v), and (vi). D 

7. Lumping of edges 

Recall that our aim is to compute 

g^{t,x) = ^ an{t)an'{t)E{Un)ax{Un')xO- 



By (6.13) we have 



Q^{t,x) = Yl anit)an'{t) J2 J2 E23o.(g)QJ,o(a')- (7.1) 

n+n'!^Mi^ Ge&n G'e&„> 



Computing the expectation E5Jox(G)23a;o(G") yields a lumping of the edges £{G) U £{G'), which we now 
describe. 
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For the following wc HxQ ~ (G, tq) G ©j and Q' ~ (C, tq') G ©tt- Thus, wc also fix the maps Iq and Ig'', 
see Proposition 6.6 (i). To streamline notation, we introduce their union QUQ' = {GU G' , tguG') defined in 
the obvious way. We also get the map IguG' that we extend by requiring that ^guG'(^i^) = if w € V(G') 
and w e V(G"). We often abbreviate t = tguG' and I = IguG'- 

As in the previous section, we abbreviate the family of labels with 



{xy : veV{GUG')) 



From (6.9) we immediately get 



^OxiG)'^xoiQ') = /Jox^fa^S^x^^^^6x 



:.(G') "^HG') 



.v,weV(GUG') 

^ 11 -^-r(e)(-^2:a(e)a;b(e)''^a;5(e)a;„(e)) • (7-2) 

ee£{GUG') 

Next, for any fixed x we assign to each edge e e £{G U G') the unordered pair of labels 

To each label configuration x we assign a lumping r(x) of the edges £{G U G') according to the value of 
the map gx- We use the word lumping to mean an equivalence relation on £{G U G"), or, equivalently, a 
partition of £{G U G'). More precisely, the lumping r(x) is defined as the equivalence relation (denoted by 
^) on £{G U G') such that e ^ e' if and only if Qxie) = ex(e')- We use the notation F = {7}-ygr, where 
7 C £{G U G') is a lump, i.e. an equivalence class. Thus, taking the expectation in (7.2) yields 



^^ox{G)'Sxo{G') = '^Sox,^G)^^^HGA^a^G')^ox,^^,^ 



Y[ {l''l(v,w)6x,x^ 

.v,weV{GUG') 

X n En^-('=)(^-.(e,-H=,'^-M.,-.(=,)' (7-3) 



7er(x) eet 

where we used that i?a(e)6(e) and i?a(e')&(e') are independent if gx{e) 7^ £'x(e') 
Next, we define the indicator function 



Ar(x) := l(F(x)=r) = 



n n n Hs^e) ^ g.{e')) 

.7#7'er ee7 e'e7' 



n n l(ex(e) = ex(e')) 



(7.4) 



indicating that a labelling x is compatible with the equivalence relation F, i.e. Px(e) = £'x(e') if and only if 
e-e'. 

By definition, ^T(e) is an even function whenever deg(e) is even and an odd function whenever deg(e) is 
odd. Moreover, the matrix elements of H were truncated in such a way that the identity (2.8) remains valid 
for H; see (5.4). Thus, the expectation (7.3) vanishes unless all lumps 7 e F(x) are of even degree, whereby 
the degree of a lump 7 is defined as 

deg(7) — XI'^'^S(^)- 
667 
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Let ^(G U G") denote the set of all lumpings of £{G U G') whose lumps are of even degree. Thus (7.3) 
becomes 

E9Jox(a)23.o(a') = Y. K(aug',r), (7.5) 

re^(GuG') 

where we defined the value of the graph Q yjQ' with lumping F as 



14(gug',r) := ^Ar(x)5ox„^^,4.,(^,4.„(^,,^. 



Lv,weV{GUG') 

X n E n P,(,) (i?.„<,,.,,,, , i?.M.,.„(., ) • (7.6) 
7er ee-f 

Next, let /„ e Sn denote the bare stem consisting of n edges. Recall that a bare stem is a graph with 
no bough edges; it is uniquely determined by its number of edges. Denote by X„ € &„ the decorated 
graph obtained from /„ by assigning the tag (s,0) to each edge (in particular, the stem /„ is completely 
nonbacktracking in Z„). Define the subset 

©: :- ©„\{X„}. 

From (7.1) and (7.5) we get the splitting 

= Y, an{t)an'{t) Y V^.(2:„Ul„sF)+ ^ a„{t)an'{t) Y Y. Y. ^AG^S'X) 

+ J2 o,n{t)an'{t) J2 J2 V;(X„ug',r)+ J2 (^n{t)an'{t) ^ ^ T4(guX„,,F). 

n+n't^M'' e'ee*, re«'(/„uG') n+n's^M'^ ee«* rea'(Gu/„/) 

(7.7) 

This is our starting point for the remaining sections. The first term on the right-hand side of (7.7) is the 
leading term, whose contribution is computed in Section 8. The remaining three terms on the right-hand 
side of (7.7) are error terms, and are estimated in Sections 9 and 10. 

8. The bare stem 

In this section we analyse the first term on the right-hand side of (7.7) by proving the following result. 
Proposition 8.1. For any continuous bounded function cp G Cb(M.'^) and T ^ we have 



^ AXL{T,X)if{X), (8.1) 
where we recall the definition of L{T, X) from (3.4). 
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The rest of this section is devoted to the proof of Proposition 8.1. The proof is similar to [1], which 
we shall frequently refer to in this section for precise definitions and proofs. We therefore assume that the 
reader has some familiarity with [1]. 

The only complication compared to [1] is that controlling higher order lumpings (resulting in high mo- 
ments of Axy) requires more effort, since, unlike in [1], the matrix elements of A are not bounded by 1 (but 
only by M^). A lump 7 containing I7I edges carries a weight M^^'^^, but this factor can be compensated by 
the fact that large lumps impose strong restrictions on the labelling of the vertices. Technically, we shall 
deal with these higher order lumpings by replacing an arbitrary lumping with a pairing whose contribution 
is small enough to compensate any powers of M resulting from the lumping. In this way we can directly 
reduce the estimate of general lumpings to pairings. The appropriate pairing will be selected by a greedy 
algorithm defined in Appendix C. 

We begin by establishing notation and recalling the relevant results from [1]. 

8.1. Pairing of edges. The simple structure oiXnUXn> allows for some notational simplifications. Following 
[1], we abbreviate ^„_„/ := ^(/„ U /„') and V;,(r) := V^{In Uln',T). Thus the left-hand side of (8.1) 
becomes 



lim y (fi 



^rl+dK/2 



E 



a^{W''^T)a^\W<i-T) 



E 



K(r). 



(8.2) 



Tei 



As in [1], we identify the vertices a(/„) and 6(/„'), as well as the vertices 6(/„) and a(/„') (this is purely a 
notational simplification). We label the vertices explicitly according to 



v(/„u/„o = {0, 



v-1}, 



a(/„) = &(/„') = 0, 6(/„) = a(/„/) = n, 



and write x = (xq, . . . ,Xn+n'-i)- See Figure 8.1. Recall that the degree of every edge of /„ U /„' is odd. 
Since, by definition of '^n,n' ^ every lump 7 G P has even degree, we conclude that every lump 7 G F has an 
even number of edges. (Note that no such statement is possible for a general lump that also contains bough 
edges. Indeed, bough edges have even degree, so that the total degree of the lump gives no information 
about the its number of edges.) 



a{In) 



b{I„ 



b{In 



a{In 




Figure 8.1: Identifying the end vertices of /„ and /„/. 
The expression (7.6) may also be simplified in the case of the bare stem. From (7.6) we get 

I4(P) = ^Ar(x)0,(x)nEn^-»(=)-Ke,' 



(8.3) 



where we defined the indicator function 



c(x) := 5oKo<5a: 



Y]^ i{xi ^ xi+2) 



~n-\-n —2 



Yl^ l(Xi ^ Xj+2) 



.4=0 
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Here we used that all edges of Z„ U 2„/ have tag (s, 0). 

Next, we make the obvious observation that, without loss of generality, we may exclude from ^„.„' all 
lumpings F satisfying Ar(x)(52,(x) = for all x and x. In particular, if 7 S F then 7 cannot contain two 
adjacent edges (since this would contradict the nonbacktracking condition in Q). 

We call lumpings F = {7} with I7I = 2 for each 7 G F pairings, and denote the subset of pairings by 
S^n.n' C ^n,n' • We shall oftcn use the notation H ~ {tt} instead of F = {7} to denote a pairing. We represent 
a pair tt — {e, e'} graphically by drawing a line, called a bridge, that joins the edges e, e' € E(In U /„'); see 
Figure 8.2. 

We shall show that the leading order contribution to the left-hand side of (8.1) comes from the pairings; 
all higher order lumpings are subleading. Moreover, only the contribution of the so-called ladder pairing 
(see Subsection 8.3 below) survives in the limit VK ^ 00. In fact, only the ladder whose bridges all carry a 
straight tag (see below for the definition of the tagging of bridges) yields a nonvanishing contribution to the 
left-hand side of (8.1). 




n + n' — 1 




If He 



Figure 8.2: A general pairing (left) and a ladder (right), 
is a pairing we get from (7.4) 



An(x) = 



n n n i(^x(e)^ex(e')) 



n l(gx(e) = gx(e')) 

J L{e,e'}en 



(8.4) 



At this point we stress that the indicator function l(^gx{e) — g^ie')) in (8.4) associated with the bridge 
{e, e'} is different from its counterpart in [1] (Equation (6.3) in [1]), where bridges carry an orientation. In 
order to make the link to [1], we tag^ bridges (similarly to Section 9 of [1]). In other words, we choose a 
map f? : n ^- {0, 1} and replace the factor l(px(e) = gix(e')) in (8.4) with Sx(7r, ^(tt)), where 

2x({e,e'},0) := l(x„(e) = .Tb(e,))l(xt(e) = a;a(e')) ' 

2x({e,e'},l) := l(x„(e) = .T„(e,))l(a;b(e) = a;6(e'))l(^a(e) ^ ^Ke)) • 
We call a bridge n straight if '!9(7r) = and twisted if '!9(7r) — 1. See Figure 8.3. Clearly, we have 

Hg^e) - gx(e')) = 2x({e, e'}, 0) + Sx({e, e'}, 1) . (8.5) 

Thus, each untagged bridge may be split into a straight and a twisted one. We define 



An,^(x) 



n n n i(ex(e)^gx(e')) 



n^-(^'^w) 



.wen 



(8.6) 



^To avoid confusion we emphasize that these bridge tags have nothing to do with the edge tags of a decorated graph. The 
use of the same word is merely a symptom of a regrettable lack of imagination on the authors' part. 
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X g y 



X e y 



X e y 



y e X 



Figure 8.3: A straight bridge (left, drawn with a solid line) and a twisted bridge (right, drawn with a dotted line) 
joining the edges e and e'. In each case we indicate how the vertex labels of 2: = a(e) and y = 6(e) determine the 
vertex labels of a(e') and 6(e'). 



so that we have 



In this manner we may split 



where 



J2 An,^(x) = An(x). 

i?e{0,l}n 

y,(n)- ^ T4(n,^), 

i?e{o,i}" 



y,(n,7?) := ^An,^(x)Q,(x)[]E[]iJ,^,^,,,,^,. 

X Tren eGTT 



(8.7) 



8.2. Parallel and antiparallel bridges. In [1], the combinatorial complexity of a pairing was measured using 
the size of its skeleton pairing. The definition of the skeleton pairing relies on the following notion of parallel 
and antiparallel bridges. We say that tt, tt' are parallel if there exist i, j ^ {0, n} such that 

TT = {(*-l,z),(j,j + l)}, tt' = {(Z,* + I),(j-I,j)}. 

Similarly, 7r,7r' are antiparallel if there exist i,j ^ {0,n} such that 

TT = {(*-l,z),(j-l,j)}, tt' = {(j,z+l),(j,j + l)}. 

Note that the notion (anti)parallel is independent of the bridge tags. See Figure 8.4. A sequence of bridges 
TTi, . . . , TTj; is called an (anti)ladder if tt^ and tt^+i are (anti)parallel for alH = 1, . . . , fc — 1. 



i - 1 i i + 1 



i - 1 i i + 1 




J + 1 ^ 



J-1 




J + 1 ^ 



3-1 



Figure 8.4: Two parallel bridges (left) and two antiparallel bridges (right). 
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Next, we assign to each tagged pairing (H, d) a skeleton S{Il, i?) according to the fohowing rules. Every 
pair of parallel bridges that are both straight is replaced by a single straight bridge; every pair of antiparallel 
bridges that are both twisted is replaced by a single twisted bridge. (See [1], Section 7.2, for a precise 
definition of this collapsing of bridges. Each collapsing step removes one bridge - and hence two edges from 
/„ U /„' but always retains the vertices a(/„), &(/«), a(/„'), 6(/„').) We repeat this procedure until we reach 
a tagged pairing, denoted by 5(11, d), which contains no parallel straight bridges and no antiparallel twisted 
bridges. The resulting skeleton is independent of the order in which pairs of bridges are collapsed. We have 
that 5(n, d) e ^m,m' for some m ^ n and m' ^ n'. See Figure 8.5, and [1], Sections 7 and 9, for full details. 





(n,r) 



5(n,r) 



Figure 8.5: A tagged pairing along with its tagged skeleton. We draw straight bridges with solid lines and twisted 
bridges with dotted lines. 



8.3. The ladder. We now extract the leading order contribution to (8.2), the (complete) ladder. The ladder 
of degree n, denoted by L„, is the pairing given by 



„ = {{(0,l),(2n-l,0)},{(l,2),(2n-2,2n-l)},...,{(n-l,n),(n,n+l)}} 



^n — 



(8.9) 



see Figure 8.2. Set i^n = G {0, 1}^"; thus (L„,'(9„) is the ladder whose bridges are all straight. Since all 

I ^ I 2 

bridges of (i„,'!9„) are straight, we find that the expectation in (8.8) is equal to E|iifa;„(e)2:6(e) | • ^ow the 
argument of [1], Section 8, applies almost verbatim, and, together with Lemma 5.3, we get 



X n=0 ^ J J 



(8.10) 



for all <~p G Cf,(M''). In fact, the only needed modification to the argument of [1], Section 8, is that, in the 
proof of Lemma 8.4 of [1], the i.i.d. random variables {Bi) now have the law 



1 y/W^^\ 



M ^/w^ 






aeZ'' 



W J W 



(8.11) 



instead of 






aeZ'' 
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Here [•] denotes integer part and Sa the point mass at a. It is easy to see that the covariance matrix of the 
measure (8.11) is TE + o(l) as T4^ ^ oo, where, we recall, 

Ijij = / dx J [x) XiXj . 

8.4. Bound on the non-pair lumps. We now give a bound on the contribution of the higher-order lumpings, 
i.e. lumpings that contain lumps of size more than two. Wc start by assigning to each pairing 11 G ^n.n' its 
minimum skeleton size 

mill) := min (number of bridges in 5'(n, ??)) . (8-12) 

i?e{0,l}n^ 

The quantity m{lV) is the correct measure of the combinatorial complexity of the pairing IT. 
Let r G '^n,n' be an arbitrary lumping and define 

P(r) :- E(M-2)- (8-13) 

We say that a lumping V e '^n.n' is a refinement of a lumping F G '^n.n' if for every 7' G F' there is a 7 G F 
such that "/' d "f. If n G ^n,n' is a pairing that is a refinement of F, we say that 11 is a refining pairing of F. 

Lemma 8.2. For each F G ^n,n' \ ^ny there is a refining 'pairing 11 G <^n,n' o/F such that 

m(n) > max(^^,2j. (8.14) 

Proof. See Appendix C. D 

Next, we define the nonnegative quantity Va;(r) by taking the absolute value of all random variables in 
(8.3) inside the expectation, i.e. 

F,(F) = ^Ar(x)g,(x)nEni^-o(e)-Me,h (8-15) 

Clearly, ^ 

|i4(r)| ^ v;(F). 

Moreover, for a pairing 11 G ^n,n' we define the nonnegative quantity 

i?.(n) := M4^"m^Q^(^) -Q i(g^(e)^^^(e'))a^^^^^,^^^^, (8.16) 

X {e,e'}6n 

which is essentially similar to 14(11) except that we drop the condition that different lumps must have 
different label pairs. 

Wc may now bound the contribution of the higher order lumpings in terms of pairings. 



Lemma 8.3. We have that 



E E^-(r) ^ J2 E^-(n). (8.17) 



m(n)>2 
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Proof. We have the bound 



.7er 667 



M^P(r)^Q^(^)^^(^) 



.■yeT eE'r 



Xa(,)Xb(^) 



where in the first step we used that 

Let {n(r)}rg<s' , denote a choice of refining pairings satisfying (8.14). Then from Lemma 8.2 we get 



E E^-(r) ^ E E i(n(r) = n)^T4(r) 

^ ^ M^^^W ^ l(n(r)-n)^Q,(x)Ar(x) 



ne.f 



re«'„ 



.7eree7 



a;a(e)a:6(e) 



(8.18) 



where the sums over 11 are constrained by ?Ti(n) ^ 2. 

Next, we introduce a family ^p = {£'7}7(Er, where q-^ is an unordered pair of labels. Thus we may rewrite, 
for fixed x, 



Ar(x) 



.7GreG7 



Xa{e)Xn.) 



E 



n 1('?7 ^ (?7') 
■77^7' 



7eree7 



We now relax the condition n(r) = 11 in (8.18) to the condition that 11 is a refinement of F. We may then 
express F as F = T p using a partition P = {p\ of the set of bridges 11, where T p is defined as Vp = {7p}pep 
and 7p := Uwep ^' i-*^- ^ expresses which bridges of 11 need to be lumped to obtain F. 
Thus we get for F = Tp 



E 



n 1(^7 7^ ^7' 

7#7' 



n n i('?x(e) = (?7)<^a;„(,)x,(,) 

7er ee7 



E^-P(^n) n n l(£'x('^) ^ 9^)^x^f,^x, 



Me) > 



Sn 



ttGII ee-TT 



where we defined 



IpiQn) 






n n n i(^- ^ ^- ) 

.p^p' TVGp TC'Gp' 



The claim (8.17) now follows from the identity 

1 - E^^(^n), 
p 

and the fact that any lumping F G ^n.n' of which 11 is a refinement can be written as F = Tp for some 
partition P of the set of bridges H. D 
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8.5. Bounds on all lumpings. In this final subsection we show that the contribution to (8.2) of all non- 
pairings, as well as all tagged pairings different from the straight ladder of Subsection 8.3, vanishes as 
W —7- oo. For a pairing 11 € ^n,n' and tagging -d E {0, l}'^, we define T4(n, t?) in the obvious way (see 
(8.15), (8.6), and (8.7)). Clearly, we have that 

14 (n) = Y. ^-(n,^)- 

i5e{0,l}n 

For n, n' ^ we define 
and 

"•n,n' •" hn^n' ^ Onn'VxlLn^Vn) ^ hn^n' — Onn'yx\Ln,Vnj (8.20) 

is the contribution of all diagrams apart from the main term, the straight ladder, where we used that 
Vx{Ln,'&n) — VxiLmi^n). We remark that in [1] /i* „/ was denoted by hn^n'- 

Lemma 8.4. For any integer 1 ^ p ^ M^ we have 

as well as 

Y ^n^n' ^ C. (8.22) 

n+n' — 2p 

Proof. The proof of (8.21) is almost identical to the proof of Equation (7.10) in [1]. We bound general 
lumpings in terms of non-ladder pairings, whose contribution we estimate by analysing vertex orbits in 
skeleton graphs (see Sections 7.4 - 7.6 in [1]). 

More precisely, using Lemma 8.3 we see that the only needed modification to the argument of [1] arises 
from the additional factor M'*'^™'^'^^ in (8.16) compared to Equation (7.1) of [1]. Let m denote the number 
of bridges in the skeleton 5(n,-!?); then we have rh ^ miU) by the definition (8.12) of m(n). Thus we 
find that Equation (7.9) of [1] (in which F is now a tagged pairing not equal to a straight ladder) remains 
vafid provided that the factor M^/^M^^/^ is replaced with M^/^M^^^/^"'*'')™. Thus wc find from Equation 
(7.10) of [1] that, for 1 s^ p s^ M^, 

where we emphasize the additional factor of 2'' arising from the sum over all bridge tags of skeleton pairings, 
as described in Section 9 of [1]. The first term \/M accounts for the term p = 1 which consists of an 
antiladder with one rung whose contribution is trivially bounded by 1/M. As explained at the end of 
Section 7.5 in [1], the factor p^^/^ + M^^/^ results from a detailed heat kernel estimate (Lemma 7.5 in [1]) 
which follows from the band structure of H . If, instead of the band structure, we had imposed only the two 
conditions Y.y ^^ly = 1 and aly < M-\ then (8.23) would be vafid without the factor p-^/^ + M''^/^. 
Now (8.23) immediately yields 

Y /i;,n' sS CM^/2-1/3+85 ^ (8.24) 
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which is (8.21). 

Moreover, (8.22) foUows from (8.21) and the estimate 



J2vALn,K} ^ 1; 

X 

see Subsection 8.3. D 

From (4.5) and (8.21) we get 

J2 \an{W'^T)a^'{W''''T)\hl^, ^ ( ^ \a^{W'-T)\^\a^,{W'''^T)\A (^ E K"')') 

Vp^MA' \n+n'=2p / / 

S$ C(M^ M^-2/3+165^ 1/2 

= o(l) (8.25) 



a,sW -)-oo (see (5.7)). Then Proposition 8.1 foUows from (8.2), (8.10), (8.20) and (8.25). 



9. The boughs for k, < 1/5 

In this section we estimate the contribution of the boughs. It turns out that strengthening our assumption 
on K to K < 1/5 (from k < 1/3) greatly simphfies the estimate of the boughs. Thus, throughout this section 
we assume that k < 1/5. The next section is devoted to the case k < 1/3. 

In Section 8 we computed the contribution of the first term of (7.7); sec Proposition 8.1. We now focus 
our attention on the remaining three terms of (7.7), and show that their ^^-norm in x vanishes. We need to 
estimate 

E, -.^ ^ |a„(t)a„Ki)|E E E E \VAG^G',n\ (9.1) 

and 

E2 := J2 |anW«n'W|E E E \Vx{gmn',T)\ (9.2) 

n+n's^M'^ X Se®* re<^(Gu/„/ ) 

(It is easy to check that E2 estimates both terms on the second hne of (7.7) since H is Hermitian). 

Proposition 9.1. Choose fi and S so that 

K + Ad < n < 1/5-45. 

Then 

hm El = hm E2 — ■ 

VK— 7-co W~^oo 



35 



The rest of this section is devoted to the proof of Proposition 9.1. We expound our main argument for Ei. 
The estimate of E2 is very similar, and we shall describe the required minor modifications in Subsection 9.8. 
From (7.6) we get 



E,^Y^ |a„(t)a„'WlEE T. T. T. 

n+n'!^M'^ X see* e'e®*, re?#(GuG') x:r(x)=r 

7er ee-f v,wev(G\jG') 

where we abbreviated t = tguG' and I = IguG'- 

Next, we relax all nonbacktracking conditions in I pertaining to bough vertices. This gives 

n+n'^Mf see* e'e«', rGa'(GuG') x:r(x)=r 7er ee7 



(9.3) 



where 



(3(x) :- So^^^^^Sox^^^,^6a:^^^,^x 



HO) 



Lv,weV(S{G)) 



Yl l{d{v,w) = 2)l{x,^x^) 

.v,weV{S(G')) 



(9.4) 



implements the nonbacktracking condition on the stems S{G) and 5(G"). The estimate (9.3) follows from 



E^o 



^(G) °xxt(G) "a:2:„(G/) (JOx^G') 



J] {i-iiv,w)dx^x^) ^ g(x), 



D,tuGV(GUG') 



since, by definition of ©„ C 0(1, the stems S{G) and S(G') are completely nonbacktracking in / (i.e. l{v, w) = 1 
if d{v, w) — 2 and v,w E V{S{G) U S(G))). Note that Q depends only on the labels of stem vertices. 

9.1. Sketch of the argument. Before embarking on the estimate of Ei, we outline our strategy. We first fix 
the graph G ^ G' and the lumping T. We assume that G ^ G' y^ In Ul„/ for all n,n', i.e. we are not dealing 
with the bare stem. Starting from the bough leaves of GUG", we sum successively over all vertex labels that 
do not belong to the stem. The order of summation is such that we sum over the label of a bough vertex 
only after we have summed over the labels of all of its children. 

Our estimate uses two crucial facts. First, each leaf is a small edge (this is an immediate consequence 
of the growth process that generates boughs; see Proposition 6.6 (ii)). This means that, if a leaf is not 
lumped with any other edge, its contribution is small. Second, edges that are lumped together yield a small 
contribution owing to fixing of labels, which reduces the entropy factor associated with the summation of 
the labels. 

Any large bough edge yields a contribution bounded by 1, as follows from 



7 . IEP(b 0) (-ffx„(e)a;6(e) ' ^2:i,(.)a;„(e) 



a;i,(e) 



E 

a;b(e) 



/ . ''^|^2;„(e)a;i,(e) | 



s$ 1. 



(9.5) 
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Ideally, we would hope that each leaf, being a small edge, yield a factor of essentially M ^. For example, if 
T(e) — {h, 2), summation over the label of the final vertex of e yields 

ElE^-(e)(^-.(e.-Me,'^-M = ,-Mj = E IE|^..(e,.Kj ^ ^ ^^ . (9.6) 

In this case the order of a bough with I leaves would be M^' (up to an irrelevant factor M^^'). It is easy 
to see that a similar estimate holds for any leaf that is not lumped with another edge. This smallness fights 
against the combinatorics of the number of rooted, oriented trees with k edges and I leaves, which is of the 
order fc^'~^/Z^' (see (9.39) below). Thus we would find that the sum over the contributions of all rooted 
oriented trees with k edges is 

since k ^ M^ ^ M^/^. (The requirement Z ^ 1 is simply a statement that there is at least one bough edge.) 
It would then be a relatively straightforward matter to bound the contribution of all families of boughs 
growing from the stem, and to show that it vanishes asW^oo. 

Unfortunately, this simple approach breaks down because two leaves of type (5, 1) lumped together yield 
a contribution 

y y 

which is much larger than the desired factor M^'^. We emphasize that this problem only occurs when a 
lump consists solely of leaves of type {b, 1). Indeed, lumping leaves with tags (6,2), (6,3) or (6,4) yields a 
sufficiently high negative power of M to keep the simple power counting mentioned above valid. For example, 
if two leaves of type (6, 2) are lumped, their contribution is 



E^d^-^i')' < 



M 



In fact, it would suffice that every lump had a single edge whose tag is not (6, 1) to ensure that each leaf 
yield a factor 1/M. 

In this section, we develop a method that extracts a factor 1/vM from each leaf (or, more precisely, a 
factor 1/M from pairs of leaves) instead of the optimal factor 1/M, thus allowing us to reach time scales of 
order M^/^. In order to reach time scales of order M^/^, we need a decay of order 1/M from each leaf. This 
requires more effort and is done in Section 10. 

Notice that the estimates of the type (9.5)-(9.7) rely on £^-i?°°-bounds on the variances, ^ a'^y = 1 and 
maxy cr^ ^ M~^ for each x € A^y. In fact, all the estimates in Sections 9 and 10 rely on such power counting 
estimates. 

9.2. Ordering of edges and parametrization of lumpings. There are two natural structures governing the 
vertex labels in the bound (9.3): the tree graph G U G" and the lumping F. In the case of the bare stem 
(Section 8), we chose to sum over all vertex labels simultaneously, under the constraints imposed by F. This 
was possible because the tree graph /„U/„' of the bare stem was very simple. For a general tree graph GUG", 
however, this approach breaks down. Instead, we have to sum over the vertex labels in a manner dictated 
by the structure of the tree graph G U G', i.e. successively over each individual vertex label, starting from 
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the leaves. If all bough edges were in their own single-edge lumps, this strategy would be easy to implement. 
For a general lumping, however, we have additional constraints on the bough vertex labels arising from the 
lumping, which are completely nonlocal and in this sense conflicting with the constraints resulting from 
the tree graph structure GU G". We overcome this difficulty by introducing a special parametrization for 
lumpings (denoted by (F, A) h- > F below) that is suited to a successive summation along the bough branches. 
This parametrization is also needed for controlling the summation over all lumpings F. 

Let us fix n,n' as well as Q — (G,tg) G &'^ and Q' = {G',tg') G 25^ in the summation (9.3). We 
abbreviate £b '■= £{B(G) U B(G')) for the set of bough edges. Recall that a leaf is an edge e E £b such 
that 6(e) has degree one. We now introduce a total order ^ on the set of all edges £{G U G'). This order 
will govern the order of the summation of the vertex labels. We use the notation e ^ e' to mean e ^ e' and 
e 7^ e'. We impose the following conditions of ^. 

(i) If e and e' are both bough edges and e' is the parent of e (i.e. a(e) = b{e')) then e -< e'. 

(ii) We start the ordering from the leaves: If e is a leaf and e' is not a leaf then e ^ e'. 

(iii) Bough edges are smaller than stem edges: If e is a bough edge and e' a stem edge then e < e' . 

It is easy to see that such an order ^ exists. We choose one and consider it fixed in the sequel. Once ^ is 
given, each edge e € £{G U G') (except the last edge) has a successor, denoted by a{e) and defined as the 
smallest edge strictly greater than e. Note that the order < is not the same as the (partial) order induced 
by the directedness of the graph. Similarly, the concepts of successor and child arc unrelated. 

We shall sum over the vertex labels of the boughs, starting from the degree one vertices of the leaves. 
To this end, we need a parametrization of the lumping F e ^(G U G') that is suited for such a successive 
summation. The parametrization will be given by a map e i— > Ag on the set £b, and by F, defined as the 
restriction of F to the stem edges. The idea behind the construction of A is to set Ag to be the smallest edge 
in the lump containing e with the property that A,^ >- e; if there is no such edge, we set A^ = e. 

Definition 9.2. Denote by si[G U G') the set of mappings 

A:£b^£{GUG'), e ^ Ae, 

with the following two properties. First, ^e h e for all e. Second, if e' ,e" ~< e satisfy Ae' — Ae" — e then 
e' = e". 

The following definition will be used to reconstruct F from the pair (F, A). 

Definition 9.3. Let T be a lumping of the stem edges £{S{G) U S{G')), and A e £/{G U G'). Then we 
define F(F, A) as the finest equivalence relation on £{G U G') (denoted by ^) for which A^ ^ e for all e and 
e ^ e' whenever e and e' belong to the same lump o/F. 

Next, let u and u' denote the number of edges in S(G) and S{G') respectively. Note that u + u' is even. 
This is easy to see from the facts that stem edges have odd degree, bough edges have even degree, and the 
total degree n + n' = deg{Q U Q') is even. We have the following result which shows that any lumping F can 
be encoded using a lumping F of the stem and a map A e £/{G U G'). 

Lemma 9.4. For each F e ^(G U G') there is a pair (f , A) e ^„^„/ x s/{G U G') such that F = F(f , A). 
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Proof. Let F e ^guc be given. We define T to be the restriction of T to tlie set £3 — £{^{0) U S{G')), 
i.e. r = {7 n £s}-yer- Wc now claim that F e ^u,u' ■ Indeed, by definition of ^(G U G"), each 7 e F contains 
an even number of stem edges, which implies that the lumps of F are of even size. 

In order to define A, we assign to each bough edge e ^ £3 the smallest edge e' ;^ e, e' G £(GU G') in the 
same lump as e. If no such edge exists, we set A^ := e; otherwise we set A,. := e'. It is now immediate that 
F = F(F, A). In fact this is even a one-to-one map (a fact we shall not need however). D 



We now make use of Lemma 9.4 to sum labels Xy of bough vertices v in (9.3), starting from the leaves. 
Let us write 

Ex ^ Y, \an{t)an'{t)\ Y, Y. ^euc, (9-8) 



n+n'^Mf' 



where we defined 



re?#(GuG') x:r(x)=r 7er ee^ 



res 



x:r(x)=r(r,/i) 



7er(r,A) 



e£7 



The inequality follows from Lemma 9.4. Here u = |£(5(G))| and u' = \£{S{G'))\. Moreover, the summation 
over A is understood to mean summation over all A G s^{G U G'). 

Let us partition the vertex labels x into bough labels x^ and stem labels X5, i.e. 



{x, : i;e V(GUG')) - (xb,xs). 



where 



xs := (a;b(e) : e G £3) , xg := {x, : i; G V(5(G) U 5(G'))) 

Recall that (5(x) = Q(x5); see (9.4). Thus we get 



(9.9) 
(9.10) 



E, 



VuG 



' € 



E E Qi^s) 



v&s^^, xs:r(xs)=r 



Y.Y. 



A yiB Lee£ 



n l(^x(e) = eAAe)) 



7er(r,A) '^ef 



(9.11) 



Equation (9.11) is our starting point for estimating the contribution of the boughs. 

The roadmap for the following subsections is as follows. We start by fixing all summation variables in 
(9.11). In a first step, we sum over the bough labels x^ (Subsection 9.3). In a second step, we sum over 
the bough lumpings A (Subsection 9.4). The result of these summations is the bound (9.28) on Egug'- In 
a third step, we sum over all stem labels (i.e. xg and F) which yields a factor hu^u' (Subsection 9.5). In a 
fourth step, we plug the estimate (9.28) back into (9.8) and sum over the tagging r (Subsections 9.5 and 
9.6). Finally, we sum over the bough graphs G,G' (Subsection 9.7). 
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9.3. Sum over bough labels. In this subsection wc fix G,G', A as well as an order ^, and sum over x^ in 
(9.11). The following definitions will prove helpful. 

Definition 9.5. On Eb we define the inverse A^^ of A by setting A'^^ := e' if there exists a (necessarily 
unique) e' ^ e such that A^i — e; otherwise we set A^^ = e. Obviously, A'^^ :< e. We say that a bough edge 
e is lonely (with respect to A) if e = A^ = A~^. 

Note that e is lonely with respect to A if and only if e is the only edge in its lump of r(r, A) (this 
property is independent of F) . 

For now we assume that all nonlcaf bough edges have tag (b, 0); dealing with different nonlcaf bough tags 
is very easy and is done at the end of this subsection. Define the new tagging t = Tj^ through 

{T(e) if e is not a bough leaf 
(b, 2) if e is a lonely bough leaf 
(b, 5) if e is a nonlonely bough leaf, 

where we introduced the new bough tag (6, 5) whose associated polynomial (see table on page 17) reads 

The motivation behind this definition is the following. If e is a lonely leaf, its contribution to (9.11) can be 
bounded by 



EPr(e)(^a;„(,)a;,(,),i?a;,(,)X,(,)) ^ ^P{b.2){H^^^,^x,^,y H^^^^^^^^^^)\ ^ 



M' 



2S 



J^ 3:a{^)Xb{c) 



(9.12) 



as can be easily seen from Proposition 6.6 (ii) and Lemma 5.3. If e is a leaf that is not lonely, its contribution 
in the worst case is of the same order as if its tag were (b, 0). Here the worst case is given by T(e) = (6, 1). 
The best we can do is use the trivial bound 



\P{b,i)\Hxy, Hyx)\ ^ \P(^l,rj^[Hxy, Hyx) 



(9.13) 



for all i. From (9.12) and (9.13) we see that the smallness of a leaf of type (6, 1) is only useful if it is lonely; 
otherwise, its contribution is the same as if it were an edge of type (6, 0). For instance, we have 



EP^tA){H^y,Hy,)^ =E{\H,y\^-a 



12 2 n2 

'xyj 






^P{bfl){Hxy,Hy 



Now we claim that 



i(c)Xb(s-j ' Xb(c)X^i^^iJ 






(9.14) 



Indeed, this follows immediately from (9.12), (9.13), and the definition of f . In fact, the definition of r was 
chosen so as to satisfy (9.14). 

Next, we sum over the bough labels x^ in the formula, obtained from (9.11) and (9.14), 



E 



Yl l{g^{e) ^ g^{A,)) 



eeSE 



11 ii\PT{e)[Hx^^^iXb(,_),Hxt^^)Xa(,^)) 



(9.15) 
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by successively suniming up all labels in xs, in the order defined by :<. We denote the current summation 
edge hy e Cz £b (meaning that in the current step we sum over the label ^^(g)), and call e the running edge. 
When we tackle the edge e, we shall sum it out, by which we mean that we sum over the label xi,tg\ of the 
final vertex of e, and think of e as being struck from the graph G U G". Thus, if the running edge is e, then 
all edges e ^ e have already been summed out, and hence struck from G U G". In this manner we shall 
successively sum out all bough edges and strike them all from G U G'. 
For a running edge e € Sg define the subset of bough edges 



B^"^ := {eeSB ■■ ehe}. 



(9.16) 



The set B^"^' represents the bough edges that have not yet been summed out when e is the running edge. 
We also abbreviate 



r(5) 



If e is a bough edge, we define 



{Xb{e 



: e G 



B'^^'I), ^(«) := (Ae 



: e e 



b(-)) 



i?(^-) :. j: 



n i(^x(. 



gx(Ae)) 



11 111 'r(e) V^a(e)a;(,(e) I ^Xi(^^)Xa(„ 



(9.17) 



(9.18) 



If e is not a bough edge, we set R'^'^^ :— 1. 

Let eo be the first edge of £(GU G'). Moreover, (9.14) yields 



E 



n l(gx(e)-ex(^e)) 



eeSE 






s$ R 



(eo) 



We now proceed recursively, starting with e — cq, summing over Xt,tg\, then setting e to be the next edge (with 
respect to ^), summing over Xi,fg\, and so on until e is the first stem edge. In other words, we successively 
sum out all bough edges in the order specified by ^. At each step, we get a bound of the form 

R^"^ s^ e(e,A)i?('"('^», 

where £,{e,A) > is the factor resulting from the summation over Xh(g). Recall that a{e) is the successor 
(with respect to :<) of e. The following lemma gives an expression for ^{e,A). It also identifies the "bad 
leaves", i.e. the leaves whose contribution to the right-hand side of (9.14) is of order one, as the leaves e that 
satisfy A'^^ -< e = A^,. Our approach will eventually work because the number of bad leaves cannot be too 
large (see Lemma 9.7 below). 

Lemma 9.6. For each e <E £b we have the bound R'^'^^ ^ ^{e,A)R^'^^'^'>\ where 
ae,A) := 



2M^' 
M 

2M^ 
M 



l(j4e — e) if e is not a leaf 

l(Ag 1 ^ e = Ae)2M^^ if e is a leaf 



Proof. Assume first that e £ £b is not a leaf. Then we have T(e) = (6, 0) (recall that we assumed that all 
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nonleaf bough edges have tag (6, 0)). li Ag = e, we get 



R^'^ < J2 



X 



n l(ex(e)-gx(Ae)) 



11 ll\-^r{e)[j^x^^^iXt(,_y-Hxt(^iXa(^) 



/ . ^\P'r{e){Hx^^^■,x^^^^,Hx^^^^x^(Jl^J 



Xb(S 



< K>('^(e)) 

-^ -I L , 

where in the second step we used that -r(e) = (fo, 0), and consequently 



(9.19) 



to sum over the label x^gy 
If Ag >~ e, we get 



R^'^ < J2 



X 



n l(ex(e)-ex(Ae)) 



< 



11 llr T(e)(-n2:^(_^ja;^(^j,ii2:(,(^ja;^(_^j 

7er(r,A(-('?))) '^^'i' 

J2 l(^x(e) - ex(Ae)) ||P?(e)(i?x,(„x,(,),i?x,(„x„(,))L 
Xb(e) 



M 



.i?(-(e-)) ^ 



where in the last step we used the bound 

\\P(b,0)\Hxy, Hyx) 



< M^'aly ^ 



M 



2(5 



M 



(9.20) 



(9.21) 



as well as 

l(£'x(e) = £'x(^g)) ^ l(a;6(e) = a;fc(^_)) + l(xfc(g) = Xa(Ae)) • 
Note that the summation over x^jg) in (9.20) is restricted to the two values Xa{As) £^nd Xft(^_), which are 
fixed as they belong to x'^'^''^^). This concludes the proof of Lemma 9.6 in the case that e is not a leaf. 

Next, let e € i?B be a leaf. If Ag^ = e = Ag then e is lonely and T(e) = {b, 2). Thus we get, exactly as in 
(9.19) and using (9.12), 

M 
If ylg ^ ~< e = Ag then r(e) = (6, 5). Therefore, using 

||P(,,5)(i/.„i/,.)|L ^ 2m2^'< 

we get, as in (9.19), 

i?(^) < 2M2*i?('^('^)) . 

If e ^ Ag then r(e) = (6, 5) and we find, as in (9.20), 

M 



This concludes the proof. 



a 
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Putting everything together we get by iteration, for a fixed xg, 



E 



H l(ex(e) = gx(Ae)) 



cESe 






S^ F{A) nEnr-('^)(^-<.(e)-Me,'^-.(e,.„(e,) ' (9-22) 

7er '^e-)' 



where 



F(^) 



n 



eGf s nonlcaf 



2M 



2(5 



M 



l(Ae=e)) n f^+l(A-^^e-^e)2iV/ 



2<5 



eG^B leaf 



V M 



(9.23) 



So far we assumed that ah nonleaf bough tags were (6, 0). Now we deal with arbitrary taggings. We spht 
the tagging t = {tb,ts) into a bough and stem tagging, where 

TB ■■= (r(e) : e e £b) , rg := (r(e) : e e £{S(G) U 5(G'))) • 

We now define F{A,tb) in such a way that (9.22), with F{A) replaced by F{A,tb), holds for an arbitrary 
tagging r. 

Let e be a nonleaf bough edge. If T(e) — (6, i) for i > 0, Proposition 6.6 (iii) implies that i ^ 2. Therefore 

the bound 

M^^ - 



\P{h,i)(Hxy,F[, 



VX)\ ^ j^ \^xy\ 1 



valid for all i ^ 2, implies that each nonlcaf bough edge whose tag is not (6, 0) contributes an additional 
factor M^^^^^ to the right-hand side of (9.22) compared to if its tag were (6, 0). Thus we have that, for an 
arbitrary tagging r = (tb,ts), the estimate (9.22) is valid with F{A) replaced by 



FiA,TB) 



n 



/M2 



25 \ l(r{e)^{b.a)) 



.e^Ss nonlcaf 



V M 



f2M 



25 



V M 



+ 1{A, = e) 



n ( 



2M' 
V M 



2<5 



1{A-^ -<e^Ae)2M 



2(5 



e&£B leaf 

Thus we get from (9.11) 

re^„„, xs:r(xs)=r -4 ^gp ^€7 



(9.24) 



(9.25) 



9.4. Sum over bough lumpings. In this subsection we estimate ^^ F{A, tb)- Let £1 C £b denote the subset 
of bough leaves. Multiplying out the product over leaves in (9.24) yields 



F{A,tb) ^ 



n i 



J^2S\ l('^Ce)/(^,0)) /27\^2<5 



.eG^B nonlcaf 



V M 



M 



l(Ae = e] 



E 



ae{0,l}^' Lee^B leaf 






V M 



2(5 



(9.26) 
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Let L :~ \£i\ denote the number of bough leaves. We claim that the right-hand side of (9.26) vanishes unless 
\a\ :— X^e '^e ^ [^/2]j where [•] denotes integer part. This is an immediate consequence of the following 
Lemma. 

Lemma 9.7. The set of bad leaves C := {e (E £i : A'^^ -< e = A^} contains at most [L/2] elements. 

Proof. If e € £ then it follows from the definition of ^ that A'^^ E £i\ C. In words: A bad leaf always 
comes with a unique companion that is not bad. D 



Abbreviating ^„gro^i,£, ,„ 



e{0,l}^' :|aK[L/2] 



byEia 



\a\^lL/2] 



, we get from (9.26) 
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.e££B nonlcaf 



^2(5 \ l(T(e)#(''.0)) /27\f2(5 
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M 



l{Ae = e) 



(9.27) 
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|aK[L/2] 



eG^B leaf 
]^p5\ l(T(e)#(h,0)) 
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n 



.eG^B leaf 






2M 



2(5\ 



€ C2^ 



< C 



, n i"^) 

.eG^B nonleaf 
CM'^'^M^X^''^ -|-r 



2^X l(r(e)#(b,0))- 



1 



M 



c^Sb nonlcaf 



M 



2^25+p 
2S\ l(T(e)#(h,0)) 



M^" 



M J 



L-[L/2] 



(2M 



25\ [i/2] 



M 



where we used that 25 + 2^ < 1, and performed the sum over A trivially using the fact that, for each e, A^ 
takes values in a set of size at most M^. 
Summarizing, we get from (9.25) 



Egug' ^ C 



\ M 



L/2 



n 

e^Ss nonleaf 






2S\ l(r(e)#(6,0)) 



rew. 



E E Q(^s)niEnr-('^)(^-<>(e)-Me,,^xH.,..( 

xs:r(xs)=r 7er ''<^i 



(9.28) 



We can understand the first factor in (9.28) as follows. Each leaf carries a factor M~^ due to its smallness. 
We estimated the combinatorial factor arising from the sum over lumpings by M^ per leaf (which is near 
optimal in the case when most bough edges are leaves). Therefore, ideally, each leaf should contribute a 
factor M~^~^^ (up to an irrelevant M'^^). The above argument is only able to exploit this factor for half 
of the leaves; this is why we have the exponent L/2 instead of the desired L in (9.28). This deficiency is 
the main reason why the exponent of the time scale k is restricted to k < 1/5 in this section. If L/2 were 
replaced with L at this point, the whole argument of Section 9 would be valid up to time scales of order 
Ml/3. 
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9.5. Decoupling of the graphs and the tags. The summation in (9.8) over the decorated graphs G involves 
summing over G and tq under the constraint 

eef(G) 

and similarly for Q' . In order to sum over G and tq separately, it is convenient to decouple them. To this 
end, we define the degree of the boughs and the stem separately, 

deg(6(G),TB) :- Y. deg(e), deg{S{G),Ts) :- ^ deg(e) . 

ee£{B{G)) e(££{S(G)) 



As above, we use the variable u to denote \£(S{G))\. Moreover, we introduce the variable r — 1,2,3, .. . 
through 

deg(5(G),Ts) = u + 2r. 

That r is an integer follows from the fact that all stem edges have odd degree (since a stem edge has degree 
1 or 3). The variable r is equal to the number of small edges (i.e. edges of type (s, 1) which have degree 3) 
in the stem £{S{G)). The primed variables u', r' are defined similarly in terms of G' . 

Let us denote by 1{G) and /(C) the number of bough leaves in G and G" respectively. Now we may write, 
using first (9.8) and then (9.28), 

n—1 n' — 1 

El i^ Y |a„(i)a„'(i)l E E E E ^eus' 

X [l(|£(5(G))| - u)l{dcgiS{G),Ts) ^u + 2r)l(deg(S(G),TB) =n-u- 2r)] [primed 

n—1 n —1 



n+n'^M'' 



G-S'e&i M=o M'=o r,r'>o reg*^ „, xs : r(xs)=r 



yer e67 



[l(|£(5(G))| = w)l(deg(5(G), ts)^u + 2r)l(deg(S(G),TB) =n-u- 2r)] [primed 






l(G)+l(G') 



n 



/M2 



2S\ l(T(e)5^(b,0)) 



eG^B nonlcaf 



M 



(9.29) 



where [primed] means the preceding product of indicator functions with primed variables. The condition 
M < n is equivalent to requiring that Q y^Tn- 
Next, in (9.29) we bound 



rea'u.u' xs:r(xs)=r 7er ^£7 

This follows immediately from (8.19), (8.15), the bound 



< 
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2(5 \ '■+''' 



f^u-u' 



Prie) (-f^2:„(,)Xi,(,) , Hx^,^^)X^^^^ ) 



^ ll^.o(e,XM.)l ifT(e)===(s,0) 



Ml 
M 



l-^:i;„(e)^Me)l if T(e) = (s,l), 
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X 



and the fact that precisely r + r' stem edges have tag (s, 1). Thus we get 

/ ][/[25\ r+r' n-ln'-l 

El ^ J2 \<^n{t)an'it)\ J2 i^^) J2J2^^'"' 

n+n'^Mi^ r,r'>0 ^ ' u=0 u' =0 

Y, \}{\^{S{G))\ = w)l(deg(5(G),rs) = u + 2r)l(deg(H(G),TB) ^n-u- 2r)] [primed 

eGf B nonlcaf 

In the next lemma we show that we can replace the condition 

deg{B{G),TB)=n-u-2r with 2\£{B{G))\ ^ n- u -2r 

to obtain an upper bound. Thus we decouple the dependence of the indicator function on G from its 
dependence on the tagging tb- We do this by adding bough edges of type (6, 0) to G, and by ensuring that 
this procedure does not decrease the estimate of the graph contributing to (9.30). 

Lemma 9.8. We have that 

El ^ J2 \an{t)an'it)\ J2 [^^) EE^".«' 

n+n'!^Mi^ r,r'>0 ^ ^ tt=0 u'=0 



Y, [l{\£{S{G))\^u)l{deg{S{G),Ts)^u + 2r)l{2\£{B{G))\^n-u-2r) 



X 

g,g'e@„ 



primed 



"^(^i^j n [-w) • ^'-''^ 

eGfsnonlcaf 

Proof. Fix n, n', r, r', u, m'. Note first that 

2D := dcg(S(G),rB)-2|£(S(G))|, (9.32) 

is a nonnegative even number. It is nonnegative because every bough edge has degree at least two, and even 
because both terms of the right-hand side of (9.32) are even. 

Let Q = (G, tg) satisfy deg(S(G), tg) = n — u — 2r. We construct a tagged graph Q = (G, t^) as follows. 
If D = then we set Q = Q . If D > then we denote by v the stem vertex that is closest to a(G) such 
that V is the root of a bough. (Because D > Q there is such a. v.) We then define tj to be tj but with the 
vertex v replaced with a path consisting of D bough edges, each carrying the tag (5, 0). (More precisely, if e 
denotes the bough edge incident to w, we separate the vertices a(e) and v and join them with path of length 
D carrying tags (6, 0)). Thus, we simply lengthen a leaf by adding D additional large edges. 

We claim that Q has the following properties. 

(i) The map Q i-^ Q is injective. 
(ii) G and G have the same number of bough leaves. 
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(iii) 2\£{B{G))\ ^n-u~2r. 

(iv) The number of small nonleaf bough edges is the same in Q and Q, i.e. 



^25\ l(TG(e)#(6,0)) /M2■5^ l(^G(e)#(^0)) 



n =r = n 



ee£(e(G)) nonleaf ee£(e(G)) nonleaf 

(v) Q and tj have the same tagged stem. 

Properties (ii) - (v) are immediate from the definition of Q. Property (i) follows from the fact that Q can 
be reconstructed from Q as follows. Set Q^, ~ {G^,,tg,) '■= G- Let e be the first bough edge of G* reached 
along the walk (see Figure 6.7) around G*. If the total degree of the boughs oiQ^, is greater than 2\£{B{G))\, 
remove the edge e from Q^,. (Note that in this case Q^, ^ Q, and the edge e G £{B{G^,)) was added to G in the 
above construction.) Repeat this process until the total degree of the boughs of Q^, is equal to 2\£{B{G))\. 

Then g, = g. 

Constructing a tagged graph Q' in the same way from Q' , we bound the term indexed by Q, Q' on the 
right-hand side of (9.30) by the term corresponding to Q^ Q' . Using the fact that the map Q h^ Q is injective 
we may therefore bound the right-hand side of (9.30) by the right-hand side of (9.31), writing Q and Q' 
instead of Q and Q' . D 



9.6. Sum over taggings. Thanks to Lemma 9.8, we may perform the sums over G, G', tb, and ts separately 
in (9.31). We start with the sum over tb- From Lemma 9.8 we get 

/ ]\/pS\ r+r' n-ln'-l 
n+n'^Mi^ r,r'>0 ^ ' u=Ou'=0 



Y, E[l(l^('^(^))l = «)l(deg(5(G),Ts) =u + 2r)l{2\£{B{G))\ = n - u - 2r) 



X 

G,G'eW rs 



primed 



^^E(^^)"*^ n (m;i)"-'"*"'\ ,..3) 

Tb eG^B nonleaf 

The last line of (9.33) is bounded by 

HG)+l(G') ,,j, l(G) + l(G') 

c(^^)^GK-H'(-')(i + ^) ^g(^^^^^)^. (9.34) 

Next, we sum over the stem taggings ts in (9.33). The constraint deg(5(G),T5) — u + 2r means that the 
stem S{G) = lu has u — r edges with tag (s,0) and r edges with tag (s, 1). Thus we get in (9.33) 



^l(deg(5(G),rs) = « + 2r)l(deg(5(G'),r5) = «' + 2r') = (")(") s$ M'^'^^+^'K (9.35) 
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Plugging (9.34) and (9.35) into (9.33) yields 

E^ ^ c J2 i""W«n'(t)i E ^L^ EEv 



n+n'^JW r,r'^0 ^ ^ m=0 m'=0 



,, , i(G) + i(G') 



l{\£{S{G))\^u)l{2\£{B{G))\=n-u-2r)\[^TiuieA\r^^^ . (9.36) 

9.7. Sum over the bough graphs. We now sum over G^G' e 211 and complete the estimate of Ei. From 
(9.36) we get 

n—l n' — 1 

El ^ C E |a«(i)Q;„'(t)| E E ^"'"' "^"^"^"'■"' ' (9.37) 

where we defined 

Z„,„ := E^y^— E ^{\£{S{G))\^u)l{2\£{B{G))\=n-u-2r)i ^ . (9.38) 

The graph G has a stem S{G) — lu of size u, to which are attached boughs consisting together of 

n — u — 2r 



kn,u{r) = k{r) 

edges. Note that, because u < n, we always have fc(r) + r > 0. 

Next, let s ^ be the number of boughs in G. We order the s boughs of G in some arbitrary manner 
and index them using i — 1, . . . , s. Let fc, be the number of edges in the i-th bough, and li the number of 
leaves in the i-th bough. Denote by Sk^i the number of oriented, unlabelled, rooted trees with k edges and I 
leaves. Thus we get from (9.38), splitting the contributions s = and s ^ 1, 



^n,u ^ 



r>0 ^ ' 



i(fc(r) = o)+E , E E---En^^.^'^ 



s^l ^ ' ki+--+k^=k(r) li = l l^ = li=l ^ 



where we sum over fci ^ 1 for all i. The binomial factor accounts for the locations of the roots of the boughs, 
which may be located at any of the u + 1 stem vertices. 

The number Sk.i is known as the Naranya number. For the convenience of the reader, we outline its key 
properties in the following short combinatorial digression. For full details see e.g. [7], p. 237. Denote by X^j 
the set of sequences {wi, W2, . . • , W2fc) with k elements +1 and k elements —1, such that all partial sums are 
nonnegative and 

The set Xk^i parametrizes the set of oriented, unlabelled, rooted trees with k edges and I leaves. This 
identification is the well-known bijection between such trees and Dick paths. It is constructed by walking 
around the tree, as in Figure 6.7, whereby at each step we add the element +1 to the sequence if we move 
away from the root and the element —1 if we move towards the root. See e.g. [3], Chapter 1, for further 
details. Thus we have Sk.i = \Xk.i\. In [7], p. 237, it is proved that 



Having found the expression (9.39) for Sk,i, we may continue our estimate of Zn,u- We get 



Zn,u ^ 7 ^ 
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s^l ^ ^ fciH hfes = fe(r) /i = l /^ = li=l 

M+1' 
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s t-tu^H^^) 



s>l ^ ^ fciH hfc^ = fe(r) i=l * /i=l ^ 



M 



^fM'^^M^'V ^ fu+l 
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where we used that ki ^ M^, /i + 4(5 < 1/5, and the fact that r ^ 1 ii k{r) = 0. Thus we get 
CM^^Mt' .^/M^^M'^y.^ /u+l\ /k(r)-l\ f CM'^^M^'^'''^ 

r>0 ^ ' s>l ^ 
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From (9.37) and (9.40) we may finahy conclude 

n— 1 n' — 1 



n— 1 n — 1 ^ 

^1 < o(l) ^ |a„(i)a„,(t)|^^/^„,„,-— ^ 0(1) ^ |a„(t)a„.(t)| 



tt=0 «'=0 



n+n'^MA' 



M^ 
M^ 



(9.40) 



o(l), (9.41) 



where we used (8.22), Cauchy-Schwarz, and (4.5). 

9.8. Bound on E2. In this final subsection, we show that E2 vanishes as W —>■ 00. Recall from (9.2) that 

E2 = Y. \o'n{t)an,{t)\Y, E E \VAGiiIn',r)\ 

Now the preceding discussion, after setting Q' ~ Z„/ and u' = n' carries over verbatim. The analogue of 
(9.41) yields 

E2 ^ 0(1) Y. \»n{t)a,At)\YK.^.— ^ «(1) E]\^^KWI E I""'WIa^^'^«,«' • 

n+n'^Mi^ u=0 \n=0 / \tt,n'=0 / 

The first parenthesis is bounded by a constant (using Cauchy-Schwarz and (4.5)). Wc bound the second 
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parenthesis using Lemma 8.4 and (4.5): 



M'' M'^ Mi^ 



u.n'—Q 
fu/2 Z^ 2^ «."' 



u=0 



p^M'' u+n'=2p 






This completes the proof of Proposition 9.1. 



10. The boughs for k, < 1/3 

In this section we extend the resuh of Section 9 (i.e. Proposition 9.1) from k < 1/5 to k < 1/3. The goal of 
this section is to prove the following result. 

Proposition 10.1. Choose n and 6 so that 

K + iS < M < 1/3-8(5. 

Then 

lim El — lim i?2 = , 

where E\ and E2 are defined in (9.1) and (9.2) respectively. 

10.1. Sketch of the argument. In Section 9 we estimated the contribution of the boughs by summing 
successively, starting from the leaves, over the label of the final vertex Xh(g) of each bough edge e. We called 
this process summing out the running edge e and interpreted it as striking e from the graph G U G". This 
summation was done for a fixed lumping which induces constraints on the values of the labels. In particular, 
we used the simple fact that, if the running edge e is lumped with another edge that has not yet been 
summed out, then the label of final vertex a;f,(e) of e is fixed. This reduces the entropy factor associated 
with the summation over a;f,(e) from ikf to 2. In general, bigger lumps typically have smaller contributions 
and this effect counterbalances the fact that the combinatorics of the lumpings consisting of bigger lumps is 
larger. If, on the other hand, a leaf e is not lumped with any other edge (and therefore its end-label xi,rg\ 
can be summed up without restriction), then the factor resulting from summing out e is small; see (9.12). 

It turns out that the summations over all bough labels and bough lumpings (i.e. over x^ and A in the 
notation of Section 9) are not critical on the time scales we are concerned with, t ~ M'^ where k < 1/3. 
Hence these summations we can be done generously (see (9.26) where the main contribution comes from 
\a\ — [L/2\). The main reason for the restriction k < 1/3 is that the exponent k = 1/3 is critical when 
estimating the summation over all stem lumpings; see [1], Section 11. With the method presented in this 
section, k < 1/3 is also critical for the summation over the bough graphs (i.e. over G U G'). 

As outlined in Subsection 9.1, the key difficulty when estimating the contribution of the boughs is to 
extract a sufficiently high negative power of M from the summing out of each bough leaf. This power is 
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needed to control the combinatorics resulting from summing over all bough graphs. Ideally, each bough leaf 
should give a factor M^^ (up to factors of M^), but in Section 9 wc saw that this is not true for leaves of 
type {b, 1). Accordingly, we were only able to extract a factor M^^/^ from each bough leaf; see Lemma 9.7 
and (9.28). More precisely, the only obstacle to extracting the full factor M^^ from every leaf, and thus 
reaching time scales of order M^/"^, was lumps consisting exclusively of leaves of type (6, 1); see (9.7). 

In this section we overcome this obstacle by exploiting the fact that, if the running edge e is a leaf that 
is lumped with another edge that has not been summed out, then both of its vertex labels, Xaig) and Xf,(e), 
are fixed. In order to make use of the reduction of the entropy factor resulting from the fixing of ^^(e); we 
need to sum over both Xf,(e) and Xatg) when our algorithm tackles the leaf e. The sum over Xaig) corresponds 
to summing out the parent edge of e. This additional summation over Xa{e) is clearly not possible for every 
leaf since several leaves may have a common parent or the parent of the leaf may be on the stem whose 
labels are summed over separately. Thus the simultaneous summation over both labels of a leaf can only 
be applied once for each group of adjacent leaves (namely, to the free leaf of the group; see below), and 
is not applicable at all for leaves incident to the stem (called degenerate leaves; see below). However, this 
deficiency is counteracted by the fact that the number of boughs with large groups of adjacent leaves, as 
well as many leaves incident to the stem, is considerably smaller than the number of arbitrary boughs (see 
Lemma 10.8). This gain in the graph combinatorics is sufficient to compensate for the large contribution of 
groups of adjacent leaves and of leaves incident to the stem. 

Roughly speaking, we gain a factor M~^/^ from summing out each degenerate leaf, essentially as in 
Section 9. Additionally, with the double summation procedure for the free leaves, we gain the optimal factor 
M~^ from summing out a free leaf together with its parent. Actually, we get the somewhat larger factor 
^-i+M+55^ where the additional Af represents the entropy factor from summing over bough lumpings A 
as in Section 9. (Recall that the combinatorics of the bough lumping, encoded in the function e h- > ylg, 
is overestimated by allowing Ae to by any of the M^ edges.) These gains have to be compared with the 
combinatorics of the graphs. The number of bough graphs with a given number of free and degenerate leaves 
can be easily estimated; this (with a slightly different parametrization) is the content of Lemma 10.8 below. 
Then it would be a fairly straightforward enumeration to sum up the contribution of all boughs; this will 
eventually be done in the second part of Subsection 10.7. 

Unfortunately, this simple-minded procedure is substantially complicated by a technical hurdle. In Sec- 
tion 9 the graph structure of the boughs and a simple ordering of the lumps determined a natural order 
of summation over the bough edges in such a way that the necessary size factor could be extracted from 
each edge at the time it was summed out. This idea was implemented by recursive relations of the type 
i?'^'^-' ^ ^R^'^^'^^^ in Section 9, where recall that a{e) denotes the successor of e. In the current situation, 
we have to extract a factor M~^ from each free leaf. If a free leaf e is bad (i.e. it is lumped with an edge 
preceding it in the order ^ but with no edge following it, written Ag -< e — A^; see Lemma 9.7), then the 
simple-minded approach of Section 9 yields a factor of order 1 from summing out e. (In fact, a key step in 
Section 9 was to bound the number of such bad leaves.) 

The solution is to reallocate dynamically, along the summation procedure, the weight factors from the 
running edge to edges that will be summed out at a later stage. In other words we make sure that, if e is a 
leaf, when summing out the edge e := Ag -< e we transfer a part of the smallness resulting from summing 
out e to the leaf e. If e itself is not a free leaf this is easy, because we can afford to transfer all of the 
smallness resulting from summing out e (i.e. M^^) to e. If e itself is a free leaf then this approach does 
not work, because the combined summing out of e and e yields a smallness factor M^^, which is not small 
enough to be shared among two free leaves. We solve this problem by summing out e and a{e) in one step, 
as explained above. By choosing the order ^ appropriately, we shall ensure that the successor cr(e) of any 
free leaf e is its parent (i.e. a^ — 6<j(e))i so this double summation amounts to summing up the labels of both 
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vertices of e at the same time. This yields a total smallncss factor Ad ^, half of which is used to sum out e 
(and hence get a small contribution), and the other half transferred to e. 

Thus, when summing out a free leaf e, we always also sum out its successor (7(e). In practice, we need 
to consider all possible cases for Ag and A^(^^-^, but only the cases where Ae ^ {e,cr(e)} are interesting (since 
otherwise A^ is cannot be a leaf larger than e). The various cases are summarized in Proposition 10.5 (iii) 
(note that in the notation of Proposition 10.5 the running edge is e, which was denoted by e in the above 
discussion.). 

In the preceding paragraphs we neglected the role of the entropy factor M^ arising from the summation 
over the bough lumpings A. The reallocation of weights (implemented in Proposition 10.5) is designed 
in a way that ensures that every free leaf yields a small contribution of order M~^+^ (up to irrelevant 
M^ ' factors) after the summation over bough lumpings. Thus, we not only shift weights arising from the 
summation over labels, but also entropy factors associated with summing over lumpings. More precisely, if 
e ~< Ae and both e and Ae are leaves, then we shall shift a factor M~^~^'^^^^ from e to Ae (the tiny power 3S 
is unimportant, and needed only to compensate the various powers of M that arise in our estimates). We 
have seen above that a factor M~^ is available for transfer irrespective whether e is bound (its total gain can 
be transferred) or e is free (its total gain, M~^, can be shared between e and Ae). To see why transferring 
the entropy factor ikf^ is necessary, consider for instance the case where e is a bound leaf and Ae is a leaf. 
After the smallness M~^ has been transferred from e to Ae, we shall have to sum over Ae, which yields 
an entropy factor M^. To ensure that the contribution of e after the transfer and the summation over Ae 
is not 0{M'^) but 0(1), we transfer only a factor M^^+^+'^*^'') instead of M~^ from e to Ae- In this way, 
sufficient smallness (i.e. a factor M~^) remains with e to compensate the entropy factor M^ associated with 
the summation over Ae- 

Summarizing, we transfer M^^+^+'^*^ ■* from e to Ae, thus moving the combined contribution (weight 
times entropy) from e, where it was obtained, to Ae, where it is used. This transfer makes the iterative 
argument cleaner; it provides a simple way of making sure that every free leaf yields a factor M"^"*"^"*" '^ ^ 
Without this procedure the total weight of the leaves would be the same, but we would need a more 
complicated bookkeeping of the small factors M^^+^+'^('') to ensure that they arise precisely as often as 
free leaves. For instance, if e is a bound leaf and Ae a free leaf, the contribution of e would be M"^"'"^"'"'^^''^ 
(summing out e and summing up for the possible lumpings Ae) and the contribution of Ae would only be 
O(M^). 

We shall bookkecp the weights using tags as before, but now we shall work with laggings t^'^' depending 
on the running edge e that will express this reallocation process. We shall also introduce a new tag, (6, 6), to 
record the smallness needed from each lonely leaf; it bears the weight that we shift around, i.e. M^^+^+'^'^M^'^; 
see (10.2). Using it, we may transfer smallness from the running edge to another leaf that has become lonely 
only after all other edges in its lump have been summed out. In other words, the concept of loneliness, and 
the smallness factor associated with it, becomes dynamical (see Definition 10.4). The goal is to organize the 
summation over the bough labels in such a way that one or at most two edges are summed out in one step 
and, as before, recursive relations of the type i?^^) ^ ^i?^'^^'^'' or i?^'^^ ^ ^i?*^*^ ('^^^ keep track of the result. 
The running quantity W- \ which expresses the size of the terms not yet summed out, will depend on the 
dynamical tagging, t*-^'. Much of the heavy notation of the following subsections is due to the meticulous 
bookkeeping of this dynamical process. 

10.2. Classification and ordering of leaves. As in Section 9, we first concentrate on the term Ei. Our starting 
point for the proof arc the bounds (9.8) and (9.11), where we split the vertex labels according to (9.9) and 
(9.10). 

Let all summation variables in (9.11) be fixed. We begin by classifying all leaves in £b = £{B(G)UB{G')). 
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To this end, we recall that B{G) U B{G') = IJ^ Ti consists of disjoint boughs (rooted oriented trees) Ti whose 
roots are distinct stem vertices. If all edges of a bough Ti are leaves, we call Ti degenerate. Otherwise we 
call Ti nondegenerate. Thus, all edges of a degenerate bough are incident to its root. We call the edges of a 
(non)degenerate bough (non) degenerate edges. 

Next, we assign each leaf of £b to one of three categories: degenerate, free, or bound. See Figure 10.1. 



Figure 10.1: A graph G with five boughs, two of which are degenerate. We draw one possible choice of free leaves. 
Free leaves are drawn with solid lines, bound leaves with dotted hues, and degenerate leaves with dashed lines. 

Definition 10.2. A leaf is degenerate if it belongs to a degenerate bough. For each nondegenerate bough T, 
we choose a maximal subset of leaves Ct C £{T) with the properties that 

(i) no leaf in Ct is incident to the root of T , 

(ii) no two leaves of Ct are adjacent. 

We call the leaves in Ct free, and the remaining leaves of £(T) bound. 

Note that, by definition, each bound leaf is adjacent to a free leaf. Thus, in a nondegenerate bough, each 
group of adjacent leaves contains precisely one free leaf. 

Next, we introduce a total order ^ on£(GUG'), as in Section 9, which will dictate the order of summation 
over the labels of the bough vertices. As in Section 9, we denote by cr(e) the immediate successor of e with 
respect to ^ (provided that e is not the last edge oi £{GU G')). We impose the following conditions of ^. 

(i) If e and e' are both bough edges and e' is the parent of e then e < e' . 

(ii) A free leaf immediately precedes its parent: If e is a free leaf then cr(e) is the parent of e. 

(iii) Nondegenerate boughs precede degenerate boughs which precede the stem: If e belongs to a nonde- 
generate bough, e' to a degenerate bough, and e" to the stem, then e < e' < e" . 

These properties encode the plan that children will be summed up before their parents (as in Section 9) and, 
additionally, in case of the free leaves, their parents will be summed up immediately after them. Furthermore, 
nondegenerate edges will be summed up before the degenerate ones. 

It is easy to see that an order satisfying (i)-(iii) exists^ on £(GU G"). We choose one such order and 
consider it fixed in the sequel. 



^Such an order can for instance be constructed as follows, by successively removing edges from E{G U G'). Pick any 
nondegenerate bough T (if one exists) and remove all of its bound leaves in an arbitrary order. Then remove from T either a 
free leaf followed by its parent or a nonleaf edge, in such a way as to keep the resulting tree connected (in other words, respect 
the condition (i) above). When all edges of T have been thus removed, repeat this procedure on another nondegenerate bough. 
When all nondegenerate boughs have been thus removed, remove all degenerate leaves in an arbitrary order. Finally, remove 
all stem edges in an arbitrary order. 
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As in Section 9, we parametrize a general lumping F e ^(GU G") with a pair (F, A) G ^m,m' x ^(GU G"), 
where u and u' denote the number of edges in £{S{G)) and £{S{G')) respectively. See Definitions 9.2 and 
9.3, as well as Lemma 9.4. 

10.3. Sum over nondegenerate bough labels. We start by summing over the bough labels x^ in (9.fl), 
which we recall here for convenience: 



S, 



sue' 



< 



E E Qi^s) 



re«'„ 



xs:r(xs)=r 



EE 



n l(^x(e) = g^A,)) 



xb Lee££ 



7er(r,A) 



ee7 



(10.1) 



In this subsection we sum over the vertex labels of nondegenerate boughs. 



Definition 10.3. Denote by eg the first (with respect to :<) edge of £{G U G'), by e^ the first edge of the 
degenerate boughs, and by e^ the first stem edge. If there are no degenerate boughs, set e^ — es- 

Note that, by definition of ^, we have eo ^ e^ ^ Cs (where equality is possible). 

As in Section 9, we sum over the bough labels successively using a running edge e. We start with e — cq, 
the first edge in £{G U G'), and after each step redefine e to be (T{e); we stop when e = e^. We use the 
abbreviations B^'^\ x'^'^\ and A^'^^ from Section 9; see (9.16) and (9.17). 

We shall need one additional bough tag, (6, 6). Its associated polynomial is defined by (we also recall the 
definition of Pa 5) from Section 9) 



P(b,5){Hxy, Hyx) 



2M^^a^ 



xy : 



P{bfi){Hxy,Hy 



2M- 1+^+5 Vi, . 



(10.2) 



As in Section 9, we consider first the special case that all nonleaf bough edges are large, i.e. have tag (6, 0). 

We recall that a bough edge e € £3 is lonely whenever A^ — e and there is no e' ^ e satisfying A^i = e. 

This is equivalent to saying that e is the only edge in its lump of r(r, yl). We define a new tagging t through 

T(e) if e is not a bough leaf 
^{^) '■— \ (b, 6) if e is a lonely bough leaf 
(6, 5) if e is a nonlonely leaf. 

Note that this tagging t is different from the one used in Section 9. The role of the new tag (b, 6) is to 
encode the gain from a lonely leaf, similarly to (9.12), but the estimate will be somewhat weaker. 
Using Proposition 6.6 (ii) and Lemma 5.3 it is now easy to see that in (10.1) we have the bound 



-fer{f,A) e67 



i(c)Xb(e-j ' ^Xt(e)^ 






(e)2:6(e) 1 -"Xb(e)a 



(10.3) 



At this point the definition (10.2) deserves a comment. It seems that the bound (10.3) with the definition 
of P{b,6) is wasteful; indeed, (10.3) would be correct even if on the right-hand side of the second equation of 
(10.2) we replaced M~^~^'^~^^^ with M~^~^^^ . This difference is of no consequence for our estimates, however, 
as the critical contribution to (10.3) comes not from lonely leaves, but from leaves lumped with other leaves, 
as explained around (9.7) and (9.12). Hence the wasteful additional factor M^^^* is of no consequence. The 
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factor M^^+^+''' is designed with the transferring of entropy factors M^ in mind, as explained in Section 
10.1. This turns out to be the correct choice for the algorithm contained in Proposition 10.5 below. 
Next, we introduce a generalization of the notion of loneliness that is relative to the running edge. 

Definition 10.4. Let e G £b be the running edge. We say that an edge e E £b is lonely in A^"^' whenever 
Ae = e and there is no e' < e satisfying e' € B^'^' and Ae> = e. 

Clearly, e is lonely if and only if e is lonely in A'-'^°'> = A (since eg is the first edge oi £b)- 
In order to get an adequate estimate from the successive summation over the nondegenerate bough labels, 
we shall need that, at each step indexed by e of the recursion, every leaf that is lonely in A^'^'' is small in the 
sense that it has tag (6,6). Thus, we shall have to dynamically modify the bough tagging. To this end, let 



-(e) 



(r(^-)(e 



eye] 



denote a tagging on the set of edges {e e £{GU G') : e )^ e}. The tag T(^^(e) will indicate the actual weight 
of the edge e after summing out all edges before e, i.e. it will take into account the reallocation of the weights. 
In analogy to (9.18), we define 



i?(5)(^(e-)) .= J2 



n l(ex(e)=ex(^e)) 



n Eni^r(=-)(e)(^x.(.,.M.,,i?x,,.,.„<.,)|. (10.4) 



If e is not a bough edge, we set R^'^^t'-'^'') :— 1. 

We define the initial tagging through t^'^°'> := r. Now (10.3) immediately implies that in (10.1) we may 
bound 



E 



n l(ex(e) = gx(Ae)) 



n 

7er(r,A) 



E ]_ _[ Prie) (-ffa:„(.)a;6(,) , H, 



Xb(^)Xal^)/ 



e£7 



< 



7^(eo)(^(eo)^ 



(10.5) 



We shall construct a sequence of taggings t^'^o\t'-'^^'^°''\ t^"^ (eo))^ . . . that satisfies the following property at 
each step e. 

(Lg) If e e Bf*^) is a leaf then r(«)(e) e {(5, 5), {b, 6)}. If in addition e is lonely in A(«) then r^'^^e) ^ {b, 6). 

To start the iteration in e, we note that by definition of r, the initial property (Le^) holds. 

The next proposition allows us to sum out a nondegenerate bough edge e, i.e. perform one step of the 
iteration. Before stating it, we give its main idea. If e is a nonleaf edge or a bound leaf, we sum over the 
label XMg) in (10.4). The resulting bound is given by a number ^ resulting from the summation over xug) 
multiplied with i?''^^'^^^. We use the fact that, by (Lg), a lonely leaf in A^*^^ is small in the sense that it 
carries a factor M^^^'^^^^ . In order to be able to iterate this procedure, we need that (Lcr(g)) be satisfied. 
This is not true if Ag is a bough leaf that is lonely in A^'^^'^^\ We remedy this by introducing a factor 
1 = jv^i-M-3<5 j^^-i+Ai+as^ j^jj(j absorbing the first term into ^ and the second into R('^^^^\ This wiU be 
implemented by changing the tag T'^'^^(Ag) = (6,5) to T^'^^'^^\Ag) — (6,6). 

In the case that e is a free leaf, we have to extract a factor M^^^'^^^^ from the summation over Xf,(g). 
This is relatively easy if e is lonely in A^^'; otherwise we need to distinguish further cases. The most involved 
case, in particular, occurs when Ag y e and Ag is a leaf that is lonely in A^'^^''^) (i.e. the lump of e in 
A^"^' consists only of two elements, e and Ag). In this case we need to change the tag of Ag in t^'^'-^'I^ to 
(6, 6), as described above. The resulting factor ^ is of order M~^~^'^^ M^~'^~^^ , which is much too large (here 
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jy^-i+2S (;.Q]-Qgs fj-om (9.12) exactly as in Lemma 9.6, while M^^^^^^ comes from the reallocation of this 
factor into ^ explained in the previous paragraph). We remedy this by exploiting the fact that the label of 
a(e) is also fixed by the lumping. Thus, we perform two summations in one step: over Xft/g-) as well as over 
Xa(e) = Xb((T(e))- I^ othcr words, we sum out both e and its successor cr{e) at the same time. Together these 
summations yield a factor of order M^^+^+^'*, which is small enough. 

We shall need to distinguish several different cases, which leads to a somewhat lengthy statement of the 
iteration step. The reason is that the worst-case estimates, which arise in the case Ag = e (or, if e is a 
free leaf, in the cases where we do not have Ae ^ {e,a{e)} and Ao-(g) >- o-(e)) are not good enough for the 
following step of summing over lumpings (done in Subsection 10.5). In the case Ag = e, we shall need to 
compensate the poor estimate by the smallness of the entropy factor associated with the summation over Ag 
(namely 1). In the case Ag >- e, this same entropy factor is much larger (of the order M^), but the estimate 
on ^ is sufficiently strong to compensate for this. The following Proposition collects the estimates for the 
various cases. 

Proposition 10.5. Assume that e G £b is a nondegenerate bough edge and that (Lg) holds. 

(i) If e is not a leaf, then there exists a tagging t^°'^'^>> satisfying (L„fg\) as well as the estimate 

i?(g)(^(s)) ^ ^i?('^('^))(r('^('^))), 

where 

{1 ifAg = e 

2^-1+25 ifAg)^e and Ag is not a leaf (10.6) 

2M-f'-^ if Ag)~ e and Ag is a leaf. 

(ii) If e is a hound leaf, then there exists a tagging t^'^(^'> satisfying (Lcr(g)) as well as the estimate 

Ri^)[T'^-^)) <^ ^i?('^('^))(T('^('^»), 

where 

^ := 2M-''-^ . (10.7) 

(Hi) If e is a free leaf, then there exists a tagging t*-*^ ''^" satisfying (Lg-a/g)) as well as the estimate 



where 






f Ag e {e, (T(e)} and ^^.(g) = (j{e) 

f Ag e {e, cr(e)} and A^(g) >- (j{e) (10.8) 

/Ag^{e,cr(e)}. 



The form of (10.8) is crucial for the later summation over the lumpings Ag and A^i^gy The entropy factor 
from each such summation is 0(1) if we have a "hard constraint" (i.e. that constrains Ag (or ^^.(g)) to one 
or two edges), and M^ if we have no hard constraint. Thus, summing over the lumpings Ag and ^(j(g) yields 
an entropy factor M^^"^^ \ where z = 0, 1, 2 is the number of hard constraints. It is easy to see from (10.8) 
that ^M^(2-j) is always bounded by M-^+^'+^^^\ 



56 



Proof of Proposition 10.5. In order to avoid needless special cases throughout the proof, we shall always 
assume that Ag and ^^(g) are leaves, unless otherwise stated. This assumption always covers the worst case 
scenario. 

We begin with Case (i). The cases Ag — e and Ag >- e, Ag not a leaf, arc dealt with exactly as in the 
proof of Lemma 9.6; see (9.19) and (9.20). In both cases we set T'^''^'^^^{e) := T'^{e) for all e >: (T{e). 

li Ag y e IS a. leaf we get from (9.20) 

where t^'^*^'^'^ is defined as 

1^(6,6) lie^Ag, 

i.e. the gain of size M^^^ from the summation over xi,(g) is not exploited immediately in £_, but a part of 
size M^^+^+'^'' is reallocated to the tag of Ag. Here we used the bound 



M +^+ \P(^,,;^){Hxy,Hyx)\ ^ \P(^t,Q-j{Hxy,H, 



V^J\ ' 



which we tacitly make use of in the rest of the proof. Note that the second line of (10.9) guarantees that 
(L„(g)) holds. 

Next, we consider Case (ii). If Ag ~ e, i.e. e is lonely in A^'^\ then we use (2.7) and the fact that the 
property (Lg) implies T^'^^(e) — (&, 6). Hence, by (10.2), we get 

where we set T'^''^'^^^(e) :— T'^'^'(e) for e >: (j(e). Ii Ag y e we define t^°'^'^'>'> through (10.9) and get, as in the 
proof of Case (i), 

Ilis)(^T^i)) ^ 2M"^"''i?('^('^))(r('^('^^^). 

Again, one can easily check that {L^tg\) holds. 

Now consider Case (iii). By property (ii) of the order -<, we know that a{e) is the parent of e, i.e. 
b{a{e)) = a{e). Note that in this case we sum out the two edges e and a{e) in one step. 

Consider first the case Ag — e and Acr(g) — cr(e). Then T^'^^{e) — (6,6) and T*^'^^(cr(e)) = (6,0) by 
assumption. Therefore summing over xi,tg-\ and x^tg-j = xi,t^tg\\ using (10.2) and (2.7) yields 

where t'-'^ '^'^"(e) := T^'^''{e). In the case Ag = Ao.(g) = cr(e) we have that T'-'^-'(e) is either (6,5) or (6,6), and 
T('^)(cr(e)) ^ (6,0). Thus (10.2) and (2.7) imply 

where t'^'^ ''^^^(e) := T^'^\e). It is easy to see that (LCT2(g)) holds. We have covered the first line of (10.8). 

Next, consider the case Ag = e and Ao.(g) >- cr(e). Then, using that e is lonely in A'^ (and hence carries a 
tag (6,6) by (Lg)) and that ^^(^(e)) is fixed, we get the bound 
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where we set 



thus, (Lo.2(g)) holds. Similarly, if A^ = (7{e) and A„/g\ >~ (j{e) then xi,/g-\ and a;t,(CT(e)) £^re fixed by Ag and we 
get 

where we define t'*^ ('^^^ through (10.10). This covers the second line of (fO.8). 

We now turn to the last line of (10.8). Consider the case Aj ^ {e,a{e)} and j4o-(e) = o'(e)- Thus Xft(g) 
and a:(,(cr(e)) ^re fixed by Ag, and we have 

R^'^T^^'') ^ 2M-i+2'5M-i+2'5i?('"'(^»(T('^)) = 2M-i-^+'^Af-i+^+3'5i?('''(""»(T(^)) 

^ 2M^-^^^+''_R^'^^'^"(T^'^^'^^^h 
where we set 

\(6,6) ife^Ag, 

passing part of the total gain from the double summation to Ag. In particular, (Lo-2(e)) holds. 

Next, consider the case Ag ^ {e,a{e)} and Ag^jg) >- (T(e). Assume first that Ag and Ao-(g) are not both 
bough leaves that are lonely in A^"^ ^"^'K Then we get, using again that xi,(g-j and Xft(cr(e)) £^re fixed by Ag, 
that 

^(g)(^(e)) ^ 2M-i+2'5M-i+2'5i?('"'(^»(T(^")) = 2M-i-^+'^M-i+''+3'5i?('''(""))(T(^)) 

< 2M-i-^+'^i?('"'(^»(T('"'('^»), 

where we set 

^(^2(g)) ^^ 1(6,6) if e e {Ag,A^(g)} is a leaf lonely in A('^'(^")) 

I r'*^^ (e) otherwise . 

Here we used that, in order for (Lg.2/g')) to hold, we need at most one of the bough edges Ag and A„tg\ to 

receive the tag (&, 6) in r^'^ '^'^''. 

Finally, we consider the case where both Ag =: el ^ {e, cr(e)} and A„(^g) =: e" >- a{e) are bough leaves 

that are lonely in A^*^ ^'^^\ Although our goal is to sum out only the edges e and a{e), it will prove necessary 
to first sum out all four edges e, (7(e), e', e" in order to get a sufRciently strong reduction of the entropy 
factor. Having done this, we put back the sum over the end-labels of e' and e" (thus undoing their "striking" 
out of the graph that resulted from summing them out) to get the needed factor i?*^*^ ('^^^(t'^'^^). 

Thus, we sum over all the labels of the four vertices 6(e), 6(CT(e)), 6(e'), and 6(e") in the expression for 
/j(e)(T-(e)^. -^^g fjx all other labels. Now it is easy to see that the label Xf,(e') uniquely determines the other 
three labels, so the total entropy factor for these summations is M . Hence summing over the above four 
labels in the expression for R'^'^^ {t'^'^^) yields the bound 

^(g)(^(g)) j^ {2M^^)^]Vr^R, 
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where R is the expression obtained from R^'^'{t^'^') by summing out the edges e, cr(e), e', e" . (In the estimate 
we used the worst case scenario, in which the edges e, e', e" are of type (6, 5) and the edge (j{e) of type (6, 0).) 
Next, it is easy to see that summing out the two edges e' and e" in the expression for i?*^"" (<=))(r'^'^)) gives 
the equahty 

i?(-^(5))(^(e)) ^ {2M^&YR 

since at the moment when e is summed out, both e' and e" are nonlonely bough leaves, thus T^'^'{e') = 



(s)(e'M = 



(6, 5). Thus we find 



R'^'^^t^"^) < 2M-3+2'5i?('^'(e))(^(s)) 



2M 



-1-2/^-4(5 /' 71 y—l+/i+3<5 






where we set 



-('^"(s)) 



(e) 



(6,6) ifee{Aj,^,(g)} 
r'^'^)(e) otherwise. 



The factor (M-i+^+-''*)^ is absorbed into i?('^)(T('^)) to get R'^^^t^'^^ (''''>'>) and at the same time ensure that 
(Lg.2(g-,) is satisfied. This covers the third hue of (10.8), and hence concludes the proof. D 

10.4. Sum over degenerate bough labels. In this subsection we sum over all labels associated with degenerate 
bough edges, having already summed out the nondegenerate boughs in the previous section. We estimate 
/^(ed)('.^(ed)^^ where Cd is the first degenerate bough edge of £b, and t'^'^'^' is a tagging satisfying (Lg^). Our 
strategy is very similar to Subsection 9.3. For e € B^''''^ (i.e. Cd ^ e ^ Cg) we define the inverse A~^ by 
restricting Definition 9.5 to B^'^''\ In other words, we define A^^ :~ e' if there exists a (necessarily unique) 
e' satisfying e^ ^ e' ^ e and Ae' = e. Otherwise we set A~^ :— e. 

By the assumption (Lg^), every leaf e G B^®-*) that is lonely in A'-'^''^ has tag T^'^''^(e) = (6,6). Therefore 
we may reproduce the proof of (9.22) verbatim to get, for a fixed X5, 



i?(e.)(^(e.)) ^ /(A) nEn|^-(e)(^-.( = )-.(.)'^-Me,..(. 

7er ^£7 



where 



m := n 

eeSt^d) loaf 



2M 



2(5 



M 



l(e ^ Ae) 2M-^+^+5'' + 1{A-^ ^e = Ae) 2M' 



2S 



(10.11) 



(10.12) 



Note that, unlike in (9.22), the product in (10.12) ranges only over leaves, since degenerate boughs consist 
only of leaves. 

We may now put the estimate on both nondegenerate and degenerate boughs together. From (10.5), 
Proposition 10.5, and (10.11) we get 



E 



[] l(ex(e) = g^iAe)) 



.eeSE 



11 11 "-r(e)(.-"a:a(e)2:6(e)'-"a:He)a;„(e); 

J 7er(r,A) ^^7 



< F{A) nEnr-(-)(^-»( = )-Me,'^-M.)-.<e,) ' (10.13) 
7er '=6T 
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where 



F{A) 



eG^B nonlcaf 



n 



2M-''- 



eG^B bound leaf 



X n 

e^Su free leaf 



W (l(Ae = e) + 2M-i+2'5 + l(Ae )- e is a leaf)2Af-^-'^ 

1{A, e {e,a(e)})l(A„(e) =a(e))2Af-i+^+5^ + l(Ae G {e, a(e)}) 2M-1+4*' + 2M-i-^+^ 
X JJ [2M-i+2'^ + l(e = Ae)2Af-i+^+^'' + l(yl-i -<e = Ae)2MH . (10.14) 

e^Ss degenerate leaf 



As was advertised before the proof of Proposition 10.5, this estimate is designed to counterbalance the various 
smallness factors and the entropy factors for the lumping summation. For instance, in the second line, the 
prefactor is M~^+'^*~^)^+'^'^'') where « = 0,1,2 is the number of hard constraints, so after summation over 
the lumpings, each summand wih be of the same order M~^^^^^^ . The same balance can be seen among 
the first two summands in the last line, while the last summand will be treated similarly to how the second 
factor in (9.23) was evaluated in Subsection 9.4. Finally, in the product over the nonleaves in the first line, 
only a weaker bound is available if Ae ;^ e is a leaf. But this bound is strong enough to guarantee that, 
even after summation over A, the total contribution of the nonleaves is C^ instead of C^ , where L is the 
number of leaves (see (10.18)). Since some (small, at worst 0{M~^)) gain is available for each leaf, a factor 
C^ is affordable. 

10.5. General laggings and sum over bough lumpings. So far we assumed that all nonleaf bough edges had 
tag (6,0) and all bough leaves tag (6, 1). As in Subsection 9.3, we split the tagging into a bough and stem 
tagging, T ^ {tb,ts)., and define 



F{A,tb) ■■= F{A) H (M 



25-lNl(rB(e)#(6,0)) 



(10.15) 



eG^B nonleaf 



Then (10.13) for arbitrary t holds if F{A) on the right-hand side is replaced by F{A, tb)- 

Now we sum over all bough lumpings A e ^(G U G"). We start by summing over A^ in (10.14) for all 
degenerate edges e. Now we proceed as in Subsection 9.4. In fact, we perform the summation of Subsection 
9.4 over A only on the second factor of (9.23), as the (trivial) nonleaf edges are treated separately. Moreover, 
the additional summand in each nonleaf factor l(e = Ag) 2M~^+^+^'' is trivially accounted for. Thus we get 



E n 

A'^'^d) &^£-B degenerate leaf 



2M" 



-1+25 



l(e = Ae) 2M-^+^'+'^^ + l{A-^ <e = A^) 2M 



2(5 



L'^' /2 

sc iCM-^+''+^^] , (10.16) 



where L^"*^ = L^'^'>{G U G') is the number of degenerate leaves in G U G'. From (10.1), (10.13) with an 
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arbitrary tagging tb, and (10.15) we therefore get 



E, 



VUG' 



< 



e££B nonlcaf 



n 



2M- 



eGf B bound leaf 



c^Sb free leaf 



CM 



-l + n+7S 



Lf'^'/a 



X E E Q(^s)n'^nr-(<=)(^-»(=)-Me,'^-M.,-.<e,) - (10.17) 

rea'„,„/xs:r(xs)=r 7er ^£7 

where L is the total number of bough leaves in G U G' . Recalling that the number of edges of G U G" is 
bounded by Af, we find 



Yl f 1 + 2Af-i+^+2'*~ + 2iAf-^-M ^ C^ 

e^Es nonleaf 



(10.18) 



Recall that L'^''' denotes the number of degenerate leaves of GU G'. Similarly, denote by L'^^^ the number 
of bound leaves of G U G' and by L^^") the number of free leaves of G U G'. We have proved the following 
result. 

Proposition 10.6. For any G, Q' e 0j we have 



Egug> s^ 



n (^'^ 



-l+25\nr(e)^{bm 



_eGfs nonlcaf 



i(i>) 



(/) 



{GM-'y {GM-'+f^+'Y {GM-^+^+^') 



r-i+tj.+7S\L'- 72 



>< E E Q(^s)n^nr-(-)(^-»w-Ke)'^-M=)-.(e,) • (10-19) 

?£«'„,„' xs:r(xs)=r 7er '^^^ 



10.6. Decoupling and sum over the tagging. We now proceed as in Subsection 9.5 and prove the following 
result which is analogous to Lemma 9.8. In order to state it, we split 

i(*)(GuG') = i(*)(G) + L(')(G'); 

here L'^^\G) is the number of bough leaves of G of type i, where i can be 6 (for "bound"), / (for "free"), or 
d (for "degenerate"). 

Proposition 10.7. We have 

n — 1 n — 1 

E,^Y. Mt)aAt)\ E {M-'^'T' E E ^«X 



J2 [l(|f (5(G))| == M)l(dcg(5(G),Ts) =u + 2r)l{2\£iB{G))\ ^n-u-2r) 



primed 

-l+2S\Mr{e)^ibfi)) 



X n i^^' 

eGf snonleaf 



(10.20) 
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Proof. See Appendix D. 

We may now sum over the bough tagging tb in (10.20) to get 

n— 1 n' — 1 



n 



X 



r,r'>0 



u=0 'u'=0 



E E[l(l^('5(G))l=")l(dcg('5(G)'^s)=" + 20l(2|^(^(G))| 



= n — M — 2r ) 



G,G'G2» Ts 



primed 



eGf Biionlcaf 

n— 1 n — 1 

< E i«nW-n' wi E (M-^+^O'-^"' E E V.' 



■u=0 «'=0 



E E[l(l^('5(G))l-")l(d'^g('5(G)'^s)=" + 20l(2|^(^(G))| 



= n — M — 2r ) 



G,G'G2B TS 



primed 



X (7((7^/-^)^*''(G) + L('')(G')/^^^_l+^+55Ni<^'(G)+L"'(G')/^^_l+^+75Ni<'''(G)/2+L<''>(G')/2 

where the second incquahty follows analogously to (9.34). Next, we sum over the stem tagging rg using 
(9.35), 



^1 ^ E |a"(i)an'(t)| E i^' 



n — 1 n — 1 



-l+li+2SY+r 



/ . / . "-«."' 



E [l(|f (5(G))| = u)l{2\£{B(G))\ =n-u-2r) 



G,G'ew 



primed 



X (^((7^-5)^"''(G)+i<"(G')/-^^-l+p+55Ni<^'(G) + L(^'(G')/^^^_l+^+75Ni<'"(G)/2+L(''>(G')/2 
n — 1 ri' — 1 

s% E \an{t)an'{t)\ E E ^"."' 

^(^-1+^+2.)'- ^ i(|£(5(G))|=«)l(2|£:(^(G))|=n-«-2r) 



Gean 



C{CM 



^s\l'-''Hg) 



(CM 



-l+^,+5S\L'"(G) 



y ^^^(CM-1+^+7^) 



+^+7S\L^''Hg)/2 



primed 



(10.21) 



10.7. Sum over the bough graphs. Now we may sum over the graphs G,G' E W in (10.21). A key ingredient 
is the following combinatorial estimate. Let Skfb be the number of nondcgenerate boughs with k edges, / 
free leaves, and b bound leaves. In other words, Skft is the number of oriented, unlabelled, rooted trees with 
k edges and f + b leaves, such that the number of groups of adjacent leaves, excluding leaves incident to the 
root, is equal to / (see Definition 10.2). 
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Lemma 10.8. We have 

Proof. We construct an arbitrary nondegeneratc bough corresponding to the triple (fc, /, b) in two steps. 

(i) We choose an oriented, unlabelled, rooted tree T with k — b edges and / leaves such that no two leaves 
are adjacent and no leaf is incident to the root. 

(ii) We add b leaves to T by requiring that each newly added leaf be either adjacent to an existing leaf or 
incident to the root. 

Clearly, the number of possible choices for the tree T in (i) is bounded by the number of oriented, unlabelled, 
rooted trees with k edges and / leaves. This was estimated in (9.39) by k"^^^^. 

Next, let y denote the subset of vertices of T consisting of the root of T and of all initial vertices of the 
leaves of T. Step (ii) means that we have to add b leaves to T under the constraint that each newly added 
leaf be incident to a vertex of Y. The new leaves may be thought of as being added to a certain number, 
z, of allowed slots; each slot may receive several new leaves. The number of slots associated with a vertex 
V E i^ is computed as follows. For a vertex v of T, let c^, G N denote the number of children of v in T. It 
is easy to see that the number of slots associated with the vertex w S "^ is equal to Cy if v is not the root 
and to Ct, + 1 ii v is the root. In this counting we have taken the orientation of the graph into account, 
i.e. existing edges are drawn in the plane, and the new edges emanate between them. Thus the number of 
slots associated with a leaf e is equal to the number of planar "wedges" delimited by edges incident to a(e), 
whereby the two wedges on either side of e count as one; see Figure 10.2. 



Figure 10.2; A bough with A: = 19 edges and f — 7 free leaves. The z = 14 slots for adding bound leaves are indicated 
in grey. The stem is indicated with a dotted line. 

Thus, the total number of slots is z = 1 + YlveT '^^- Now let us denote by y the subset of vertices of 
V(r) that are not leaf vertices (or in other words the root together with the vertices that have degree greater 
than one). It is easy to see that we have 

1+ ^(c.-l) = /. 

(This relation holds for any rooted tree with / leaves.) Therefore, using \'y\ = / + 1 and 'Y C 1^', we get 

z = / + 2+;^(c„-l) sc: 2/ + 1. 
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Therefore the number of ways to add b leaves to T according to (ii) is bounded by 

z-1 J ^ 
The claim follows. D 

We now proceed similarly to Subsection 9.7 in order to estimate the sum over G,G' E W in (10.21). Let 
us first concentrate on G. The stem of G has u edges. Let s ^ denote the number of nondegenerate boughs 
of G and q ^ the number of degenerate boughs of G. The nondegenerate boughs consist of altogether 
k ^ edges and the degenerate boughs oi m^ edges. 

We index the nondegenerate boughs in some arbitrary fashion using i = 1, . . . , s, and denote by fc^ ^ 1 
the number of edges in the i-th nondegenerate bough; we have ki + ■ ■ ■ + ks = k. Similarly, we index the 
degenerate boughs using i ^ 1, . . . ,q, and denote by rn^ ^ 1 the number of edges in the i-th degenerate bough 
(which is equal to the number of degenerate leaves in the i-th degenerate bough); we have mi + - ■ ■+mq = m. 

We use /i ^ 1 to count the number of free leaves and bi ^ the number of bound leaves in the i-th 
nondegenerate bough. Thus, we have the relations 

^6, -lW(G), ^,A = L(/)(G), ^to, =m = iW(G). 

i i i 

Putting all of this together, we may bound the sum over C G 211 in (10.21) as 

^(...) <c ^('"+^V" + ^Wl(2(fc + m)=n-«-2r)l(r + .s + Ol) 

GeaiT s,q ^ * ^ ^ '^ ^ k,m 

X E E E---EE---E^^.A^.---^^./.^.(---). 

fciH hfcs=fc nilH i-m,=m j\=l f^ = lbi=0 63=0 

Here the sum ^j, , , j, ^j, is understood to mean l(fc = 0) if s = (see (10.23) below); similarly for the 

sum ^j^ J , ^ ^^j. The binomial factors arise from the choice of the root vertices of the boughs (each 

bough root may be one of the u + 1 stem vertices). The constraint r + s + q ^ 1 is an immediate consequence 
of the constraint that if r = on the right-hand side of (10.21) then the second indicator function implies 
2{k + m) = n — u^ 1, and that fc > 1 implies s > 1, while m > 1 implies q > 1. This is simply a restatement 
of the fact that we must have either a bough (s ^ 1 or g ^ 1) or a small edge in the stem (r ^ 1), in 
accordance with the original restriction Q € &* . 

We also introduce the analogous primed quantities associated with G'. Thus we get from (10.21) 

n — 1 n' — 1 

Si < G E |a"(i)«n'(t)lEE^«."'^«^«^"'.«'' (10-22) 

where we defined 

Zn,u ■■= E r^ jr^ jl(2(fc + m)=n-u-2r)l(r + .s + g>l) E E 

r,s,q,k,m'^0 fciH \-ks—k mi-\ \-mq—m 

ki ks ki ks 

X E " ' E E " ' E ^''ifibi ■ ■ ■ Skj^b, 

/l = l /a=lfcl=0 fce=0 

X (M-i+^+2^)"(GM-^)''^+-+'= (GM-i+^+5^)^'^+-+^= (GM-1+^+7^)"/' . 
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Now Lemma 10.8 yields 

where we used that ki ^ ikf and ii < | — |(5. This is one stage where the restriction /i < | is crucial. 
Therefore, 

Zn,u ^ Y^ (M^)"+«1(2(A: + to) =n-M-2r)l(r + s + g^ 1) 

r.s,q,k.m'^0 

where we used that m + 1 ^ Af. 
Next, we estimate 

Y^ 1 = l(s=.0)l(fc = 0) + l(s7^0)l(ssCfc)r2|) sC (M^)[''-il+/(s,/c), (10.23) 

ki+---+ks=k ^ ^ 

where we defined 

/(s, fc) := l(s = 0)1(A: = 0) + l(s 7^ 0)l(s s^ fc) . 

Similarly, wc have 

m — 1 

q 



J2 1 - l(g-0)l(m = 0) + l(g^O)l(gsCm)f _ J ^ 2™/(g 



?TiiH \-rng—m 

Using that q ^ m and consequently 

we therefore get the following result. 
Proposition 10.9. We have the bound 

n-ln'-l 
El ^ C ^ \an{t)an'it)\ y^ y^ hu,u' Zn^uZn'.u' , 
n+n'^Mi^ u=Ou'=0 

where 

Zn,u ^ E (^"^)^^'^^^l(2(fc + m) =n-u-2r)l(r + .s + 9 ^ l)/(.s,fc)/(g,m) 

r.s,q,k,m'^0 

X (M-i+^+2*)''(CM-i+3^+5^)'(CM-i+3^+7*)'"/'. (10.24) 

Proposition 10.9 is the main result of this subsection. Note that the restriction H < n will be crucial in 
performing the summations in (10.24) over both s and m; the summation over r is less critical. This is an 
indication that both the number of boughs and their combinatorial complexity arc critically compensated by 



65 



the smallness of the lonely leaves. The geometric series in s, to, r are the key ingredients of the complicated 
estimate (10.24). The other two summation variables, k and q, are controlled by these variables, so the sum 
is finite. To ensure that it is actually small, the rather baroque collection of indicator functions is necessary. 
They make sure that at least one negative M-power is gained from one of the factors, as we shall see in the 
next subsection. 

10.8. Conclusion of the estimate. What remains is an elementary and only moderately enlightening estimate 
oi El using (10.24). 

Proposition 10.10. We have 

Zn.u ^ (CM-i+3^+^*y"""^/' + o(l)M-^ . 

Proof. In (10.24) we bound the indicator function 

l{r + s + q ^ l) s^ l{r^s^ 0)l{q ^ 1) + l(r > 1) + l(.s > 1) , 

which yields the bound Zn.u ^ ^n u + -^n « + ■^"'m ^^ self-explanatory notation. 

If r + s = in (10.24) then /(s, k) = 1 implies ,s = and hence fc = 0, so that we get 

<^„ sC Y. l(2m = n-^i)l(gsCTO)(CM-i+3^+^*)"/' < (c'M-i+3^+^^)<"""^/' . 

Next, we get 

Zn,u ^ Yl (M"^)^^'=*^^l(?-> l)l(2(fc + m) =n-u-2r)/(s,fc)/(g,TO) 

r.s,q,k.m'^0 

< ^(M-i+^+2^)'" ^(CM-i+3'^+5*)' J2 m(CM-i+3^+7^)"/' J2 1(2(^ + m)=n-u-2r) 

< CM-1+^+2*. 
Similarly we get 

ZZu ^ M-^^(M-i+^+2^)'"^(CM-i+3^+5*)* J2 ^(CM-i+3^+^^)"/'^l(2(fc + TO) =n-w-2r) 

< o(l)M-^. 
This concludes the proof. D 

From (10.22) and using Proposition 10.10 we find 

n— 1 n' — 1 

El ^ C J2 |an(i)a«'(i)|^ ^/i«,«' 

(CM-i+-^^+^^) '"""^/^ + o(l)M-H [(CM-i+3^+7^) ^"'""'^/' + o(l)M-H . (10.25) 
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Setting V = n — u and v — n' — u' yields 

^1 ^ C ^ l(w + «'s$M'')[(CM-i+3''+7*)''/%o(l)M-H[(CM-i+3M+^^)"'/Vo(l)Af-^" 

X ^ ^ l(n + n' s$ M^) |a„(i)a„,(t)|;i„_,,„,_„. . (10.26) 

The second line of (10.26) is bounded by 



n+n'^Mf^ 



|an(0"n'(*)l^n-«,n'-t>' + X! I"" (^)""-y+i'' (*) l^"-" ."-« ^ ^' 



n<Mt^ 



by (8.25), (8.22), and (4.5). Therefore (10.26) yields 

El s$ o(l). 

10.9. Bound on £2- Finally, we outline how to bound E2; the argument is almost identical to Subsection 9.8. 
The preceding analysis carries over trivially to E2, the only modification being that Q' = Z„' and u' — n' , 
i.e. we only have boughs in Q. The analogue of (10.25) yields 

n-l 

E2 ^ C Y. \an{t)a„,{t)\Y,hu^n'[{CM-'+^'^+'y'-'' + o{l)M-^ 

Now we proceed exactly as in Subsection 9.8 and get E2 = o{l). Hence the proof of Proposition 10.1 is 
complete. 



11. Proof of Theorem 3.4 

The main ingredient in the proof of Theorem 3.4 is the following estimate. 

Proposition 11.1. Let H be as in Theorem 3.4 and H the matrix whose entries are truncated as in (5.4). 
Let K < 1/3. Then there is a constant C^, depending on k, such that 

ETTUn{H/2) s^ C^N 

for all n s^ Af^. Lf n is odd then ETr Un{H / 2) = 0. 

Proof. The proof is a relatively straightforward consequence of the proof of Theorem 3.1. The claim about 
odd n is immediate since C/„ is odd for odd n. Using Proposition 6.7 we write 

ETrC/„(iJ/2) ^ J2Y1 E2J,,(g). 
X ge&„ 

The right-hand side is represented graphically, as in Section 6, by a single stem whose ends arc joined so as 
to produce a closed loop, to which are attached a family of boughs. Now the estimates of Sections 7 - 10 
carry over and yield the claim. This is a consequence of the following observations. 
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(i) Assume first that {cTxy} defines a band matrix, as in Section 2. The value associated with a graph Q 
and lumping F of the edges of Q is equal to J^x ^xiGi T), where V^' is given by Vx (see (7.6)) with one 
additional indicator function that constrains all stem vertices of Q, with the exception of its root, to 
be nonbacktracking. In the graph on the right-hand side of Figure 8.1 this may be viewed as making 
the vertex n black. 

It is now straightforward that all estimates from Sections 7-10 carry over; in fact, the additional 
indicator function in V^ results in somewhat smaller bounds. 

(ii) In order to extend the claim to the more general random matrices as defined in the statement of 
Theorem 3.4, we observe that the £^-£°°-type estimates that form the backbone of Sections 8-10 
remain unchanged. The spatial structure of the band defined by a shape function / was used in two 
places: first, in the analysis of the ladder diagrams; second, in the ensuing heat kernel bounds on the 
right-hand side of (8.23). As we are only interested in the trace (which corresponds to summing over 
all vertex labels), we do not need the precise spatial information associated with the ladders, merely 
a bound on the i?^-norm of their contribution (in fact, it is a simple matter to check that under the 
additional nonbacktracking condition the ladder pairings do not even appear). Moreover, dropping the 
detailed heat kernel bounds in (8.23) yields the bound 

n+n' —2p 

instead of (8.24). See the remarks after (8.23). D 

We may now complete the proof of Theorem 3.4. We need the following elementary results on Chebyshev 
polynomials. 

Lemma 11.2. Let n be even. 

(i) For^eR we have L/„(0 > -{n + 1). 

(ii) Un{l + C) is increasing for ^ ^ 0. 

(iii) For (, e [0, 1] we have Un{l + > e"^. 

Proof. If ^ e [—1, 1] the claim (i) is easily seen from cither (4.2) or the recursion relation (4.3). For £, ^ 1, 
the claim (i) follows immediately from the formula 

Tx . , >N sinh(n -I- 1)C .-,, ,x 

C/„(coshC) = \/^ (11-1) 

smhf 

itself a straightforward consequence of (4.2) and analyticity 
The claim (ii) follows from (11.1). 
In order to prove the claim (iii), pick ( ^ such that 1 + ^ = cosh^. Using (11.1) we get for ^ G [0, 1] 

smhC 
Denote by A^ax the largest eigenvalue of H. Then we get for ^ e [0, 1], using Lemma 11.2 (ii) and (iii), 

P(W>2 + 2e) ^ p(c/„(A„a./2)^t/„(l + 0) ^ ElA^^I^, 
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Thus Lemma 11.2 (i) and Proposition 11.1 yield 

F^i ~>o^o,\ < ETrL/„(g/2) + JV(n+l) JV(C. + n + 1) 
F(A„ax>2 + 2Cj s^ -^^^ < -^^^ , 

for all n < M**. Setting ^ =■ M^^/^+^/2 and invoking the bound (5.5) gives 

A„.axSJ2 + ^^^5^^j ^ P(^A.„a,>2+^^^57^j+CiVe < ___^_^__^ + CiV e 

(11.2) 
Choosing k satisfying 1/3 — k = e/3 and (5 = e/37 (see (5.3)) completes the proof. 



A. Proof of Proposition 5.1 

A.l. Control of the spread of time evolution. Abbreviate V't •= e~'*-^/^(5o. We start by estimating 
(i/'t , IxpV't)- Using \dti)t = i?V't/2 we find 



This gives 



2 

x,y 






2 



l-ffa;yl|a;-2/ 
for any e > 0. Here we defined 



< E ly^2e Nl \M^)\ iy)''\My)\ , (a.i) 



Next, we recall Schur's inequality, valid for any matrix A, 

\\A\\ ^ LupJ2\A,y\\ (supJ2\A,y\] . (A.2) 

\^y / \ y J- / 

Thus we get from (A.l), for any C > 0, 

\a / I \ \2 I \\ ^ f V- l^:rjy||a;-y|y^Y >r- |i?a;y||a; - ylV^^ I ,2 , xl/2/ , / ,4e , \l/2 

\dt{iptAx\w\ ^ (sup2^ ^— ^^ 1 (sup2^ '-^^ 1 (V't,|x| Vt) {iptAx) ipt) 
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where we defined 

In order to estimate B we observe tliat tlie inequality {x + y) ^ 2{x){y) implies 






l+2e 



V 

Thus we get 

B s^ 2^supy '^"^'^f~/^'^'' sc: 2-supy^^#^^^%^|^ ^ 22-supy^^%La,,(x-v)i+3^ 

(A.3) 
Next, for u ^ 1 define 

n. := (sup^^^..: 

In order to find a bound on P(ri^), we note that, by the uniform subcxponcntial decay of the entries of A, 
we have 



\{xY{vY J 



Therefore 

Moreover, from (A.3) we get on ilu 

B sC 22^usup^cr^j,(a;-y)^+^^ 

< 2-.sup^^;j-^(--y>^+^/^+^V., 

< cMsn^Y.^x^yr+'+^^al/" 

provided that 8e ^ -q. Here we used (2.4) and the assumption (2.2). 
Summarizing: On fi„ we have 



Choosing C"^ = y^y/d/2+l+Ae+d yj(,ijg 



Y- 



1 



\dt{4>u\x\''i>t)\ ^ C(^^(V'*,|x|2V;,)+w2l^2d+2+8s^^^^^^^4e^^^ 
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Let us take e < 1/4. Then we have, for any ^ > 0, 
Choosing ^^^ ~ ^2y^M+2+»e therefore yields 

Thus Gronwall's lemma, together with (■00 , l^^pi/'o) — Oj implies that on r2„ we have 
Therefore we have showed that, for all t ^ W^, we have 

on Q,u- 

A. 2. Conclusion of the proof. Let us abbreviate ipt = e~^*^^^So and ^j — e^^^^^^So- Then we have 

Thus, using ||'0t|| ~ 1, we get 



(A.5) 



Next, we observe that 



\^xy -^xy\ 



= \-l(\x\^N)l{\y\^N)\\H^y\ 

^ l{\y\^Nl2)\H^y\ + l{\x-y\^N/2)\H,y\ 



(A.6) 



This gives 



x,y,z 

< Y, l(lyl > N/2)l{\z\ > N /2)\H^y\\H^,\\^t{y)\\Mz)\ 

x,y,z 

+ 2 ^ l(|y| ^ N/2)l{\x -z\^ N /2)\H^y\\H^,\\Mv)\\Mz)\ 

x,y,z 

+ J2 l(k - y| > ^^/2)l(k - z| > 7V/2)|i/,^||i/,,||V't(y)||^t(z)| . (A.7) 

We estimate the second term of (A.7); the two other terms are dealt with in exactly the same way. On ft^ 
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the second term of (A. 7) is bounded by 

2u^ Y, {xf'{yr{zr<J.yl{\x - z| > N/2)a,, l{\y\ > N /2)\My)\\Mz)\ 

x,y,z 

< Cu^ Y,{x- vYa^y {x - zri{\x -z\^ N/2)a,, l(|y| > N/2){yf'\My)\ {z)''\Mz)\ 

x,y,z 

^ CuHl{\x\ > N/2){x)^'M{x)^'^t\\ 



X I svi]^y^{x - yf axy {x - zYl{\x - z\ > N /2)axz 

1/2 



s.wpY{x - yY a^y {x - zY\{\x - z\ ^ N/2)a^ 

< Cu'\\l{\x\ > N/2){xY'^t\\\\{xY'i^t\\ 

■^^Y>Y{x-yY(Txy\ {snY>Y{x - yYl{\x - y\ > N /2)a^y\ , 



where we used Schur's inequahty (A. 2). Next, we observe that (2.4) and (2.2) yield 

snvY.{x-yYcTxy ^ snj>{y^{x-y)-''-A''{y^{x-yY+^^-aly\' ^ C^W'I^^^^ 
^ V ^ y y 



as well as 



sup^(a; - yYl{\x - y\ ^ N/2)a,y 

X 

y 

=^ ( E \^\d+2+2e ) [Y)y 

<: Z( Y ^ V V'^/2+ 



d+2+4e 2 



l + 2£ 



Ny|>7V/2 '^' 

< ^^<i/2+l+2e 
N 

Moreover, 

||l(|:c|^Ar/2)(a;)2-7A,|| < Ar2--i||(a;)V't|| < N-^'^\\{x)i;t\\ ■ 

Estimating the first and third terms of (A. 7) along the same lines, and putting everything together, yields 

\\{H-H)M' ^ ^W'+'\\{x)M'- 



Using (A. 5) we therefore get 



IV ivtw ^ ^ 
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Integrating (A. 6) we find the bound, valid on f2„, 

uniformly for t ^ W^ . Setting u = W and recalling (A. 4) yields the claim. 

B. Proof of Proposition 5.4 

We start by partitioning A^v = Ua ^n,a into cubes A^r^ of side length W. In order to simplify notation, we 
assume that N — 2LW for some integer L G N. The (2L)'' cubes are indexed by A e Al '■= {—L, . . . , L—l}'^. 
We set 

AAr,A := {WA + x -.xi^ {{),..., W -lY] 

Let Pa denote the projection {PA'ip){x) := l(x e KM.A)i'{x). 

Next, decompose H into its cube components Hab '■= PaHPb- Thus, Has is a W^ x W^ matrix. By 
Schur's inequality (A. 2), we have 

||i?|| sC sup V WHabW ■ 

^^■^^ BEA, 

Let g be a periodic function on Al to be chosen later, and set 

Qo -^ {\\HAB\\^SM^^g{A-B)ioTal\A,BeAL}. 

Thus, on Hq we have 

\\H\\ ^ iM^'Y.9{A). (B.l) 

A 

In order to derive an estimate on the probability of fio? "^6 use the Marcinkiewicz-Zygmund inequality: 
li Zi, . . . , Zn are independent mean-zero complex random variables and ai, . . . , a„ G C, then 



E 



p/2 

Y,a^Z, ^ (Cp)P/2E(^|a,Z,|2) . (B.2) 



(See e.g. [8], Exercise 2.2.30, for a proof that gives the constant (Cp)^/^.) Defining A^ :— X^i^iP' Jensen's 
inequality therefore yields for p ^ 2 



E 



^a,z/ < (Cpf/^APEl'^^lZ.py' sc (Cpf/2AP^M!E|Z,|P ^ (CA^pf/^ max E|Z,r . 

i i i 

(B.3) 

Next, we have, for x,y E {0, . . . ,W — 1}'^, 

(. ^2 . 2 1 f [AW + x-BW-y]N \ I .(., „i , p ^ 

(CTABJxy — <^WA+S,WB+y = JJ J { fTT - -JT J [[^ - B\2L + rC^y) 



iy •— "WA+xWB+y - ~^ J \ y^ / ^ M^ '-'\2L ^ J^^xy ) , 

where |i?2y| < 1. Thus (2.4) yields 

{oABfxy < ^J{[A-B]2l). (B.4) 
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We may now estimate E I {ipi , i?AB'02) I for any p ^ 2 and V'l ; "02 G C^ satisfying 1 1 i/^i 1 1 , 1 1 "02 1 1 ^ 1 • Let us 
first take A y^ B. Then we get 



{HAB)xy 



V- xy_ [crAB)xy->kl{x)^2{y) 

r-f, \PAB)xy 



P 



(B.5) 



E\{tl;i,HAB^2)f - E 

x,y y 

where we restrict the summation to x, y satisfying (cr AB^xy 7^ 0- Observing that 

{HABJxy — H\YA+x.,WB+y ■, 

we see that the random variables {Zxy)xye{o,....w-i}'^ ^^^ independent and satisfy \Z^y\ ^ M . Therefore 
(B.3) and (B.5) yield 

x,y 

where in the last step we used (B.4). If A = _B then the random variables Z^y arc no longer independent; 
this is easily remedied by splitting the summation over x, y in (B.5) into two parts: x ^y and x > y. Using 
the estimate \a + b\P < |2o|p + \2b\P we therefore get the bound 

p/2 



E|(7/^i,i?^BV'2)r < (CpM-i+2^/([A-S]2l)) , (B.6) 



vahd for all A,B. 

Next, we estimate, using (B.6), 



'{\{iPuHABtP2)\>M^'g(A 



E\{^,,HABi^2)\' ^ (Cpf{[A-BU) 



p/2 



{M^^g{A-B)Y " lMi+2V(^-S) 



{^i,HAB^2)\-^M^'g{A-B)] ^(^A . (B.7) 



Setting p — vM for some fixed v > Q and defining g{A) :~ y f{\M,2L) yields 

/ Cv 

Note that this choice of g implies 

Y,g{A) ^ J2 4^) ^ [Y. J{A){A)^+A ( ^ (A)-'^-M < C, 



(B., 



by (2.2). 

In order to estimate |l_ffyiiB|l, we define the rectangular lattice 

I := Ue —^-L"^' : ll^ll sCl 
1 2VF'^/2 "^" 
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It is easy to see that |/| < {mi'^l'^Y'\ Now set 



^AB := \ sup |(V'i,i?AB^2)|<M^^9(A-S) 

^■0i,'02e-f 



Therefore (B.7) yields 



.^^..M/2 ( CvM^I- y^l^ 

AB) ^ 1^1 l^^^j ^ 1^ M2^ j 

We now do an approximation argument using the lattice /. Let ipi,'4'2 satisfy HV"!!!; IIV'2ll ^ 1 ^^i^ 

\\Hab\\ = (^i*,i?AB^2)- 

Now by definition of /, there are ipi,ip2 € I such that ||V'i — fAiH, ||'(/'2 ^ "02 11 ^ 1/4- This gives 

\\Hab\\ = (V'l-V'l+V'l,i^As(V'2-V^2 + ^2)) =^ ||i7AB||('2j + ^')+|(V'l,i?ABV'2)|. 

Thus, on flAB we have 

^ Ifi 

Ili^Asll s^ yM2^.9(A-S). 

We have therefore proved that f^o ^ C\a beA ^ab, which yields the probability bound 

Choosing z/ large enough yields 

for large enough Af and some fixed £ > 0. 

Moreover, (B.l) and (B.8) imply that on Vt^ we have 

IliJil ^ CM^^ . 



C. Proof of Lemma 8.2 

We start with the following observation which allows us to rule out the simple case n-\-n' ^8. Assume that 
n + n' ^ 8 and that F € %i^n' \ Vn.n' ■ In order to prove (8.14), we have to construct a refining pairing 11 of 
F satisfying mijT} ^ 2. It may be easily checked that this is always possible. Throughout this appendix we 
therefore assume that 

n + n' > 8. (C.I) 

Choose some ordering of the edges £(/„ U /„'). Then lumps are ordered by their smallest edge. 
In a first step, we construct a special refining F' of F whose lumps are of size 2 or 4. Start by setting 
Fo := F and j = 0. 

• Denote by 7 the first lump in T j that satisfies I7I ^ 6; if there is no such lump, stop. 
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Denote by 7' the union of the first four edges of 7; define Fj+i :— Tj U {7', 7 \ 7'} \ 7. (That is, cut 
the lump 7 into two lumps of sizes 4 and I7I — 4.) 



• Set J I— > J + 1 and repeat this procedure. 
After the algorithm has terminated, set F' — Tj. We now claim that 

Mr') ^ IpiT). (C.2) 

Indeed, let Ui denote the number of lumps of size i in F. Thus we have 

p(F) = 2n4 + 4n6 + 6ns + 8nio + 10ni2 + • • • . 
From the definition of F' we get 

p(F') = 2^4 + 2ne + An^ + 4nio + 6ni2 + • • • , 

and (C.2) follows. 

In a second step, we construct a refining pairing 11 of F' using a greedy algorithm that generates a finite 
sequence of lumpings (F^) that are successive refinements of each other. Additionally, along this construction 
some bridges will get a mark. Bridges that received a mark at some stage retain it for all later stages. (To 
avoid confusion, we stress that this marking has nothing to do with the bridge tags; it is only used in this 
proof.) We shall construct the algorithm and the marking in such a way that, in the resulting pairing Ft, no 
two marked bridges belong to the same (anti)laddcr. Thus, the number of marked bridges will be a lower 
bound for m{Il). As usual we call lumps of size 2 bridges. We call lumps of size 4 four-lumps. We say 
that two bridges are compatible if they are neither parallel nor antiparallel; otherwise they are said to be 
incompatible. 

The following notions will prove helpful. We say that two edges ei and 62 are bridged in Tj if {ei, 62} G Tj. 
For a four-lump of the form 7 = {ei, 62, 63, 64} we introduce the operation of bridging e\ with 62 and 63 with 
64; this means that we set Fj+i := Tj U {{ei, 62}, {63, 64}} \ 7, i.e. we split the four-lump into two bridges. 

We now define the greedy algorithm and the marking. Start by setting Fq = F' and j = 0, and let all 
bridges of Fq be unmarked. 

Let 7 be the first four-lump of T j (recall that lumps have a fixed ordering). We define F^+i by refining 
7 into two bridges, and marking one of the bridges of Fj+i. We do this in such a way that 

(i) the newly marked bridge is compatible with all other bridges of Fj+i, and 

(ii) each newly created bridge is incompatible with at most one other bridge of F^+i. 

Now we show that such a refining process together with an appropriate marking is possible. First we 
deal with the case that there are two adjacent edges ei, 62 € 7. By the nonbacktracking constraint in (5a;(x), 
this is only possible if the common vertex of ei and 62 is either or n. Denote by 63, 64 the two other edges 
of 7. It is easy to see that there is an i = 1,2 and an i' = 3,4 such that the bridge {ei,ei'} is compatible 
with all bridges of Tj. We then define the lumping Tj^i by bridging e^ with e^' as well as the two remaining 
edges of 7 with each other. We mark the newly created bridge {6^,6^/}. That properties (i) and (ii) hold 
follows readily from the definition of (anti) parallel bridges. 

Let us therefore assume from now on that no two edges of 7 are adjacent. The lumping F^+i with marked 
bridges is defined according to the following four cases. (See Figure C.l for an illustration of each case.) 
In each case, both properties (i) and (ii) are easy to check. (Note that, under the additional assumption 
Hxx = for all x, it is easy to see that any two edges of 7 must be separated by at least two edges, so that 
only Case (cl) below needs to be considered.) 
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Figure C.l: The main step of the greedy algorithm. Top: (left to right) Case (a), Case (b), Case (cl). Bottom (left 
to right): Case (c2'), Case (c2"). For each case we draw a typical scenario, in which edges of 7 are separated by a 
single edge only if this is required by the case in question. The edges in 7 are drawn using thick black lines. Bridges 
already present in Fj are drawn using solid lines, and bridges added by the current step using dotted lines. In Case 
(a), the edges in 7' are drawn using thick grey lines. 



(a) There are two edges e, e' e 7 whose neighbouring 



all belong to another four-lump 7' € Tj . We 



choose an edge e" e 7 that has at least one neighbouring edge not in 7' (it is easy to see that, since 
Tj cannot consist of two interlacing four-lumps by (C.l), there always exists such an e"). We bridge e 
with e", as well as the two remaining edges of 7 with each other. We mark the newly created bridge 
{e,e"}. 



(b) There is a bridge {e,e'} G Tj such that every edge in 7 is adjacent to either e or e'. We bridge both 
edges adjacent to e with each other, as well as both edges adjacent to e' with each other. We mark the 
bridge {e, e'}. 

(c) Neither (a) nor (b) applies. We choose Cq G 7 so that the set of four edges adjacent to eo and its two 
neighbours contains at most one other edge in 7. (By (C.l) such an eo always exists.) Define 

C '■— {ei G 7 \ {eo} : neither neighbour of ei is bridged in Tj with a neighbour of eo} . 

(cl) If C 7^ 0, it is not hard to see that there is an ei G ^ such that the bridge 7\ {cq, ei} is incompatible 
with at most one bridge of Tj. We bridge eo with ei, and both remaining edges of 7 with each 
other. We mark the bridge {eo, ei}. 

(c2) If C = 0, there is a bridge {/o, /i} G Tj such that /o is adjacent to eo, and /i is adjacent to two 
edges, ei and e2; see Figure C.l. Wc choose e2 to be the edge "antipodal" to eo in the circular 



77 



ordering of the four edges of 7, i.e. 62 is the edge that cannot be reached from cq along the circle 
without crossing another edge of 7. Clearly, one of the two selected edges has this property. 
Define 63 := 7 \ {cq, ei, 62}. Let gi ^ /i and 52 7^ /i denote the two other neighbours of ei and 

62- 

(c2') Assume first that gi and 52 a-re not bridged in Tj. In this case we bridge ei with 62 and 
eg with 63; we mark the bridge {61,62}. It is immediate that {61,62} is compatible with all 
bridges in Tj, and that {eo, 63} is incompatible with precisely one bridge in Tj. 

(c2") Assume now that gi and 32 are bridged in Tj. Then we bridge 62 with 63 and 60 with ei. 
We mark the bridge {62,63}. Since Case (b) is excluded, we find that the bridge {62,63} is 
compatible with all bridges of Tj. Moreover, the bridge {eo, 61} is incompatible with precisely 
one bridge of Tj . 

The pictures in Figure C.l depict typical scenarios, in which edges of 7 are separated by a single edge 
(they are next-nearest neighbours) only if this is explicitly required in the case being considered. It is also 
possible that additional edges are next-nearest neighbours; e.g. it may happen that /o = gi in the last 
picture. Checking the few such explicit cases, one can see that the algorithm described above works for these 
cases as well, even though the pictures are not accurate. It is this step where the special choice of 60 made 
in Case (c) is necessary. 

Set j '-^ j + 1. If Tj is not yet a pairing, we repeat the procedure. Otherwise, we set 11 := Tj and stop the 
recursion; this is the completion of the algorithm. We need two crucial observations about the algorithm. 

First, no bridge of 11 is marked twice. Indeed, in Cases (a) and (c), the bridge marked at step j is new 
(i.e. does not exist in Tj); in Case (b) the bridge marked at step j, i.e. {e, e'}, was unmarked in F^, as follows 
from the definition of Case (a). (The marking of {e, e'} could only have been done in Case (a) if there e had 
been bridged with e', but this does not happen.) Therefore, the number of marked bridges of 11 is equal to 
the number of steps of the algorithm, i.e. the number of four-lumps in F', which is p(F')/2. 

Second, no two marked bridges of 11 belong to the same (anti)laddcr. Indeed, by construction, the bridge 
marked at step j of the algorithm is compatible with all bridges of Tj. Thus, if two marked bridges of 11, 7 
and 7', belong to the same (anti)ladder in 11, then there must exist a j such that at step j we added a bridge 
7" (marked or not) that was (anti)parallel to two bridges of F^, one belonging to an (anti)ladder containing 
7 and the other to an (anti)ladder containing 7'. By construction, however, this never happens; see (ii). 

In conclusion: II has p(F')/2 marked bridges, such that no two of them lie in the same ladder or antiladder 
of n. Therefore, for any choice of tags of the bridges of II, the resulting skeleton will always contain at least 
p(F')/2 bridges. From (C.2) we therefore get m{U) > p(F)/4. 

That m{lT) > 2 is easy to see from the fact that m{TT) — 1 would imply that II is either a complete ladder 
or a complete antiladder; this never happens by the property (i) of the greedy algorithm. 



D. Proof of Proposition 10.7 

The key to the proof Proposition 10.7 is a decoupling of the bough tagging from the bough graph. The is 
done by adding an appropriate number of bough edges to G U G", as in the proof of Lemma 9.8. 

Lemma D.l. There is an injective map y : ©j — > ©o .such that for any Q = (G, tg) and Q = (G, rg) = Y{Q) 
the following properties hold. 

(i) The tagged stems of Q and Q are identical. 
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(ti) dcg{B{G),TG)^2\£{B{G))\. 
(in) For any ^,^'€0(1 "^^ have the bound 



E, 



guG' 



< 



n ^M 

.eG^Biionleaf 



-l+25\Hr(e)^{bfi)) 



CM 



-S\L 






(CM- 



{CM 



-i+M+TSNi'^Va 



re 



xs:r(xs)=r 7er ^£7 



(D.l) 



where all quantities on the right-hand side of (D.l) are defined in terms ofQUG', i-e. £*-*' = L^^'{G^G') 
for i^bj, d, and t = tq^q, . 

Using Lemma D.l we find that Proposition 10.7 follows easily by repeating to the letter the argument at 
the beginning of Subsection 9.5. 

Proof of Lemma D.l. For any graph G we define the two following cases. 

(a) B{G) is either empty or contains at least one nondegenerate bough. 

(b) B{G) consists exclusively of degenerate boughs. 

Consider first the case that both G and G" satisfy (a) . Then we may proceed exactly as in the proof of 
Lemma 9.8. Thus, we define 

D := deg{B{G),TG)-2\£{B{G))\. 

If I? = set Q = Q. Otherwise B{G) contains a nondegenerate bough. Let e be the nonleaf bough edge that 
is reached first on the walk around G (see the proof of Proposition 6.6 for the definition of the walk around 
G). Define tj as ^ in which we replaced the edge e with a path of -D + 1 edges; here the first edge of the 
path carries the tag rcie) and all other edges of the path the tag (&, 0). 
Now set Y{Q) := Q. By construction, we have that 

l'^''Hg) = l'^^'Hg), L^f\G) ^ l'-^'Hg), l'-'^Hg) = L^'^\G). 

Moreover, Q and Q have the same number of small nonleaf bough edges. It is also easy to see that Claims 
(i) and (ii) hold. Moreover, as in the proof of Lemma 9.8, we find that the map CJ 1— > (J is injective. Defining 
Q' in the same way, we find that Claim (iii) follows from Proposition 10.6. 

Next, consider the case where G satisfies (b) and G" satisfies (a). The complication here is that we 
cannot add bough edges to G without changing the numbers L^^\ L^^\ L^'^\ If D = then we can set Q ^ Q 
and proceed as above. If I? > then there must be a (degenerate) bough edge e € £{B{G)) whose tag is 
TG(e) = (6, i) for i — 2,3,4. We now use the additional small factor arising from such an edge. We claim 
that in this case we can improve the bound (10.19) to 



Egug' s% 



TT /^^-l+25\l(^(e)7^(ft,0)) 



.eG^s nonleaf 



(D.2) 



rG??„„/xs:r(xs)=r 7er ^€7 
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Note the additional factor M^^+^'' at the expense of reducing the exponent of M^^+^+^'' by 1/2. We outline 
the proof of (D.2), which is almost identical to the proof of (10.19). In choosing the ordering of edges :<, 
we require that e be the first degenerate bough edge. When tackling the edge e immediately after the 
recursive algorithm (used for nondegenerate boughs) of Proposition 10.5 has terminated, we get a bound 
^ = M-'^+^^ = Af-i+p+5<57y//-p-3<5_ jj(,j.j, ^^j; gj.g^ ^gj.j^ jg ^jjg worst-case estimate using (10.2), and the 
second arises from the fact that, thanks to the assumption on T(e), the estimate (10.3) is now in fact valid if 
we multiply the right-hand side by a factor M"^^'^''. The remaining L^'^^ — 1 degenerate edges are estimated 
exactly as in Section 10.4. Thus we get (D.2). 

Now we may proceed as above. Let e be the (degenerate) leaf that is reached first on the walk around 
G. Define t/ as tj in which we replaced the edge e with a path of Z? -I- 1 edges; here the first edge of the path 
carries the tag Ta{e) and all other edges of the path carry the tag (fe, 0). Denoting by ^ ^ 1 the number of 
leaves in G belonging to the bough containing e, we have 



-(b) I 



-,(/) 



(d), 



- T(d)( 



LW(G) =1-1, L^r>{G) = 1, L^'^'iG) = L^'^'iG)-!. 

These identities are simply an expression of the fact that the degenerate bough of G that contains e becomes 
a nondegenerate bough in G with one free leaf. Moreover, the mapping Q \-^ Q clearly satisfies Claims (i) 
and (ii). That it is injective can be seen from the fact that Q can be reconstructed from Q, similarly to the 
construction given in the proof of Lemma 9.8. 

Choosing Q' = Y{Q') as above, we find that the bound (D.l) follows from (D.2) and the bound 



which is easy to check for all I ^ 1. 

Finally, the case when both G and G" satisfy (b) is dealt with exactly as the previous case. 



a 



E. List of concepts and symbols 



graph G, G", . • • 

stem S(G) of a graph G 

boughs B(G) of a graph G 

tagging T 

nonbacktracking encoding / 

decorated graph G,G' ,■ ■ ■ 

label x-u of a vertex v 
lumping r 

pairing H 



A rooted, oriented, unlabellcd tree graph; see p. 16. 

The path of G joining the vertices a{G) and b(G); sec p. 16. 

The subgraph of G that does not contain the stem of G; a collection 
of disjoint trees; see p. 16. 

A map from the edges of a graph to the set of tags encoding the 
contribution of an edge; see p. 16 as well as Figures 6.4 and 6.5. 

A map assigning to each pair of vertices v,w a number l{v, w) = 0, 1, 
used to encode any nonbacktracking conditions; see p. 17. 
A graph G together with a tagging tq of G and a map Iq imple- 
menting any nonbacktracking conditions; see p. 17. 

An element x„ e Apf assigned to v; see p. 17. 

An equivalence relation on the set of edges arising from taking the 
expectation value; see p. 7. 

The simplest type of lumping, whose equivalence classes each consist 
of two edges; see p. 29. 
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bridge tt 

(anti) parallel bridges 
twisted and straight bridges 
tagged pairing (E, 9) 

ladder i„ 

skeleton pairing S{Il, ■&) 

lonely leaf 

degenerate, bound, free leaves 

2IT 



& 

-' n 



deg{g) 

^(GUG") 

In 
T 

®: 

'^n.n' 

CO) 

^^ n.n' 

v^{g\jg',T) 



A lump of a pairing; see p. 29. 
See p. 30. 
See p. 29. 

A pairing 11 whose bridges tt carry a tag i?(7r) G {straight, twisted}; 

see p. 29. 

The simplest pairing, whose contribution is of leading order; see p. 

31. 

Tagged pairing obtained from the tagged pairing (11, d) by collapsing 

parallel straight bridges and antiparallel twisted bridges; see p. 8.2. 

A leaf that is the only edge of its lump; see Definition 9.5. 

See Definition 10.2. 

The set of graphs; see p. 16 and Figure 6.3. 

The set of decorated graphs; see p. 17. 

The set of decorated graphs corresponding to terms of the main path 
expansion (6.4); see Definition 6.4. 

The value of the decorated graph, a random variable; see (6.9). 

The operation that makes nonbacktracking the first backtracking 
stem vertex of a decorated graph; see p. 19. 

The operation that collapses the two stem neighbours of the first 
backtracking stem vertex of a decorated graph; see p. 19. 

The set of decorated graphs obtained from ^ € 25' by applying the 
operations Tn , Tn until the stem is completely nonbacktracking; see 
Definition 6.5. 
The union of all .'iSg for tj g ©'; see Definition 6.5. 

The degree of the decorated graph Q, representing the degree of the 
polynomial '^xy{Q)] see p. 17. 

The set of decorated graphs in ©j of degree n; see p. 25. 

The set of lumpings of the edges of G U G'; see p. 27. 

The graph in 20 that consists of a bare stem with n edges; see p. 27. 

The decorated graph in 6„ obtained by assigning the tag (s, 0) to 

all edges of /„; see p. 27. 

The set ©„ \ {!„}; see p. 27. 

An abbreviation for ^(/„ U /„'); see p. 27. 

The subset of ^„,„' consisting of pairings; see p. 29. 

The value of the lumping F G ^(GUG') of the edges of the decorated 
graph Q U Q'; see (7.6). 

A total order on the edges of G U G' describing the order used for 
summing out bough vertices; see pp. 38 and 53. 
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A A map that assigns to each bough edge e an edge Ag ^ e, and is 

used to parametrize the lumping of the bough edges; see p. 38. 
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